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RECOMBINANT PROTEINS DERIVED FROM HGF AND MSP 

Field of the invention 

The present invention relates to recombinant proteins obtained from 
the combination of structural domains derived from the a subunits of 
hepatocyte growth factor (HGF) arid' macrophage stimulating protein 
(MSP). 

In particular, the engineered factors of the invention are obtained by 
combination of the hairpin loop and kringle domains of the a chains of 
HGF and/or MSP, so as to obtain a structure having two superdomains 
with an intervening linker sequence. Moreover, the invention relates to 
DNA sequences encoding the above mentioned recombinant proteins, to 
the expression vectors comprising said DNA sequences and to host cells 
containing said expression vectors. The recombinant proteins of the 
present invention are biologically active, and their activity can be 
measured by determination of their ability to induce activation of the Met 
tyrosine kinase receptor, their "scattering" effect on epithelial cells, and 
their protective effect against cell death induced by chemotherapic drugs 
(vide infra). Therefore, these molecules can conveniently be used to 
prevent or treat the toxic side effects of the chemotherapeutical treatment 
of tumours, and to reduce iatrogenic cell damage induced by other types of 
drugs. 

Technological background 

Hepatocyte Growth Factor (HGF) and Macrophage Stimulating 
Protein (MSP) are highly related proteins both structurally and 
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functionally (Fig. 1 and 2). Both these factors are secreted as an inactive 
precursor, which is processed by specific proteases which recognise a 
cleavage site inside the molecule, dividing the protein in two subunits. 
These subunits, named a chain and p chain, are linked by a disulphide 
5 bond. Thus, the mature factor is an a-p dimeric protein. Only the mature 
(dimeric) form of the factor is able to activate its receptor at the surface of 
the target cells (the Met tyrosine kinase in the case of HGF and the Ron 
tyrosine kinase in the case of MSP) and therefore to mediate biological 
responses (Naldini, L. et al., 1992, EMBO J. 11: 4825-4833; Wang, M. et 
10 al., 1994, J. Biol. Chem. 269; 3436-3440; Bottaro, D. et al., 1991, Science 
25: 802-804; Naldini, L. et al., 1991, EMBO J. 10: 2867-2878; Wang, M. 
et al., 1994, Science 266: 117-119; Gaudino, G. et al., 1994, EMBO J. 13: 
3524-3532). 

The a chain of both factors contains a hairpin loop (HL) structure 
15 and four domains with a tangle-like structure named kringles (K1-K4; 
Nakamura T et al., 1989, Nature 342:440-443; Han, S. et al.., 1991, 
Biochemistry 30: 9768-9780). The precursor also contains a signal 
sequence (LS) of 3 1 amino acids (in the case of HGF) or of 1 8 amino 
acids (in the case of MSP), removed in rough endoplasmic 
. 20 reticulum, which directs the neoformed peptide to the secretive 
pathway. The p chain contains a box with a sequence homologous 
to that typical of serine proteases, but it has no catalytic activity 
(Nakamura T et al., 1989, Nature 342:440-443; Han, S. et al.., 1991, 
Biochemistry 30: 9768-9780). Both a and p chains contribute, to the 
25 binding of the growth factor to the respective receptor (Met for HGF and 
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Ron for MSP). 

HGF and MSP polypeptides are able to induce a variety of 
biological effects besides cell proliferation. The main biological activities 
of these molecules are: stimulation of cell division (mitogen esis); 
stimulation of motility (scattering); induction of polarisation and cell 
differentiation; induction of tubule formation (branched morphogenesis); 
increase of cell survival (protection from apoptosis). The tissues that 
respond to HGF and MSP stimulation are those where cells express the 
respective Met (HGF) and Ron (MSP) receptors. The most important 
target tissues of these factors are epithelial cells of different organs, such 
as liver, kidney, lung, breast, pancreas and stomach, and some cells 
of the hematopoietic and nervous systems. A detailed review of the 
biological effects of HGF and MSP in the various tissues can be 
found in Tamagnone, L. & Comoglio, P., 1997, Cytokine & Growth Factor 
Re- views, 8: 129-142, Elsevier Science Ltd.; Zamegar, R. & 
Michalopoulos, G., 1995, J. Cell Biol. 129: 1177-1180; Medico, E. et 
al., 1996, Mol. Biol. Cell, 7: 495-504; Banu, N. et al., 1996, J. 
Immunol. 156: S2933-2940. 

In the case of HGF, the hairpin loop and the first two 
kringles are known to contain the sites of direct interaction with the 
Met receptor (Lokker NA et al., 1992, EMBO J., 11:2503-2510; 
Lokker, N. et al., 1994, Protein Engineering 7: 895-903). Two 
naturally-occurring truncated forms of HGF produced by some cells 
by alternative splicing have been described. The first one comprises 
the first kringle (NKl-HGF Cioce, V. et al., 1996, J. Biol. Chem. 
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271: 13110-13115) whereas the second one spans to the second 
kringle (NK2-HGF Miyazawa, K. et al., 1991, Eur. J. Biochem. 197: 
15-22). NK2-HGF induces cell scattering, but it is not mitogenic as 
the complete growth factor is (Hartmann, G. et al., 1992, Proc. 
Natl. Acad. Sci. USA 89: 11574-11578). However, NK2-HGF re- 
gains mitogenic activity in the presence of heparin, a 
giucosaminoglycan that binds HGF through a domain contained in 
the first kringle and which is likely to induce dimerization of NK2- 
HGF (Schwall, R. et al., 1996, J. Cell Biol. 133: 709-718). Moreover 
NK2-HGF, being a partial agonist of Met, behaves as a competitive 
inhibitor of HGF as far as the mitogenic activity is concerned (Chan, A. et 
al., 1991, Science 254: 1382-1385). NKl-HGF has also been described to 
exert partial stimulation of Met and competitive inhibition of HGF 
mitogenic activity (Cioce, V. et al., 1996, J. Biol. Chem. 271: 13110- 
13115). Anyway, a truncated factor is endowed with an activity markedly 
lower than the recombinant factors described in the invention, as shown in 
example 3. 

In the case of MSP, the interaction sites with the Ron receptor are 
less understood: some preliminary studies suggest a situation opposite of 
that of HGF, i.e. the p chain directly binds the receptor whereas the a 
chain would act stabilizing the complex (Wang MH et al., 1997, J. Biol. 
Chem. 272:16999-17004). 

The therapeutical use of molecules such as HGF and MSP is 
potentially valuable in a wide range of pathologies (Abdulla, S., 
1997, Mol. Med. Today 3: 233). Nevertheless, a number of technical 
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as well as biological complications make the application of these 
molecules in clinics difficult. First of all, the pleiotropic character 
of these factors can causes poorly selective biological responses, 
which involve undesired side effects. For example, the use of HGF 

5 to prevent some side effects of the chemotherapeutic drug cisplatin 
has been proposed (Kawaida K et al., 1994, Proc. Natl. Acad. Sci. 
91:4357-4361). Cancer patients treated with this drug can suffer 
kidney acute damage due to the cytotoxic action of cisplatin on 
proximal tubule epithelial cells. HGF is able to protect these cells 

10 against programmed death (apoptosis) induced by cisplatin, but at 
the same time it can induce an undesired proliferation of neoplastic 
cells. Other problems related to the pharmaceutical use of HGF and 
MSP are the necessity of their proteolytic activation and their 
stability, which causes technical problems. The NKl and NK2 

15 truncated forms of HGF do not require proteolytic activation, but 
they have a reduced biological activity. 
Summary of the invention 

The present invention provides recombinant molecules 
composed of a combination of structural domains derived from the 
. 20 a chains of HGF and/or MSP, which overcome the problems of the 
prior art molecules described above. The molecules of this invention 
are composed of two superdomains connected by a linker. Each 
superdomain is composed of a combination of the HL and K1-K4 
domains of the a chain of HGF and/or MSP. These engineered 

25 factors induce selective biological responses, do not require 
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proteolytic activation, are stable and are more active than the truncated 
forms of HGF described previously. 
Detailed disclosure of the invention 

The present invention relates to recombinant proteins (which will be 
5 hereinafter referred to indifferently as proteins, molecules, engineered or 
recombinant factors) characterised by a structure that comprises two 
superdomains, each consisting of a combination of HL and K1-K4 
domains derived from the a chain of HGF and/or MSP, linked by a spacer 
sequence or a linker. In particular, the invention relates to proteins of 
10 general formula (I) 

[A] - B - [C] - (D)y (I) 

in which 

[A] corresponds to the sequence (LS)ni-HL-Kl-(K2)n-(K3)o-(K4)p 
wherein (the numbering of the following amino acids refers to the HGF 
15 and MSP sequences as reported in Fig. 1 and 2, respectively): 

LS is an amino acid sequence corresponding to residues 1-31 of HGF or 1- 
18 of MSP; 

HL is an amino acid sequence derived from the a chain of HGF starting 
between residues 32-70 and ending between residues 96-127; or it is an 
20 amino acid sequence derived from the a chain of MSP starting between 
residues 19-56 and ending between residues 78-109; 

Kl is an amino acid sequence derived from the a chain of HGF starting 
between residues 97-128 and ending between residues 201-205; or it is an 
amino acid sequence derived from the a chain of MSP starting between 
25 residues 79-110 and ending between residues 1 86- 1 90; 
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K2 is an amino acid sequence derived from the a chain of HGF starting 
between residues 202-206 and ending between residues 283-299; or it is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 187-191 and ending between residues 268-282; 
5 K3 is an amino acid sequence derived from the a chain of HGF starting 
between; residues 284-300 and ending between residues 378-385; or it is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 269-283 and ending between residues 361-369; 
K4 is an amino acid sequence derived from the a chain of HGF starting 
10 between residues 379-386 and ending between residues 464-487; or it is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 362-370 and ending between residues 448-481; 
m, n, o, p can be 0 or 1 ; 

the sum n + o + p is an integer from 1 to 3 or 0, with the proviso that 
15 n>o>p; 

B is the sequence [(X)q Y]r, wherein X = Gly and Y = Ser, or Cys, or Met, 
or Ala; 

q is an integer from 2 to 8; 
r is an integer from 1 to 9; 
. 20 [C] corresponds to the sequence HL-Kl-(K2)s-(K3)t-(K4)u 
wherein HL, K1-K4 are as defined above, 

s, t, u are 0 or 1 ; the sum s + t + u is an integer from 1 to 3 or 0, with the 
proviso that s > t > u; 

D is the sequence W-Z, wherein W is a conventional proteolytic site, Z is 
25 any tag sequence useful for the purification and detection of the protein; y 
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is 0 or 1 . 

Non-limiting examples of W are consensus sequences for 
enterokinase protease, thrombin, factor Xa and IgA protease. 

Preferred proteins of general formula (I), are those in which: 
5 the HL domain is a sequence of HGF a chain ranging from amino 
acids 32 to 127; or a sequence of MPS a chain ranging from amino acids 
19 to 98; the Kl domain is a sequence of HGF a chain ranging from 
amino acids 128 to 203, or a sequence of MPS a chain ranging from 
amino acids 99 to 188; the K2 domain is a sequence of HGF a chain 
10 ranging from amino acids 204 to 294, or a sequence of MPS a chain 
ranging from amino acids 189 to 274; the K3 domain is a sequence of 
HGF a chain ranging from amino acids 286 to 383, or a sequence of MPS 
a chain ranging from amino acids 275 and 367; the K4 domain is a 
sequence of HGF a chain ranging from amino acids 384 to 487, or a 
15 sequence of MPS a chain ranging from amino acids 368 and 477. 

Among the possible combinations of the domains of general 
formula (I), the following (II) and (III) are preferred, concerning two 
recombinant factors named Metron Factor- 1 and Magic Factor- 1, 
respectively: 

. 20 LSmsp-HLmsp-K1msp-K2msp-L-HLhgf-K1hgf-K2hgf-D (Metron Factor- 1) 

(11) 
and 

LShgf-HLhgf-K1hgf-K2hgf-L-HLhgf-K1hgf-K2hgf-D (Magic Factor- 1) 
(III) 

25 For both molecules, L is a linker sequence (Gly4Ser)3, D is a tag 
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sequence Asp4-Lys-His6. 

For Metron Factor- 1, LSmsp is the sequence 1-18 of MSP, HLmsp is 
the sequence 19-56 of MSP, KImsp is the sequence 99-188 of MSP, K2msp 
is the sequence 189-274 of MSP, HLhgf is the sequence 32-127 of HGF, 
5 KIhgf is the sequence 128-203 of HGF, K2hgf is the sequence 204-294 of 
HGF. . 

For Magic Factor- 1, HLhgf, KIhgf, K2hgf are as defined above, 
LShgf is the sequence 1-31 of HGF. 

The hybrid molecules of the invention are prepared by genetic 
10 engineering techniques according to a strategy involving the following 
steps: 

a) construction of DNA encoding the desired protein; 

b) insertion of DNA in an expression vector; 

c) transformation of a host cell with recombinant DNA (rDNA); 

15 d) culture of the transformed host cell so as to express the recombinant 
protein; 

e) extraction and purification of the produced recombinant protein. 

The DNA sequences corresponding to HGF or MSP structural 
domains can be obtained by synthesis or starting from DNA encoding for 
. 20 the two natural factors. For example, screening of cDNA libraries can be 
carried out using suitable probes, so as to isolate HGF or MSP cDNA. 
Alternatively, HGF or MSP cDNA can be obtained by reverse 
transcription from purified mRNA from suitable cells. 

cDNAs coding for the fragments of HGF and MSP p chains can be 
25 amplificated by PGR (Mullis, K.B. and Faloona, F.A., 1987, Methods in 
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Enzymoi. 155, 335-350), and the amplification products can be 
recombined making use of suitable restriction sites, naturally occurring in 
the factor sequences or artificially introduced in the oligonucleotide 
sequence used for the amplification. 

In greater detail, one of the above mentioned strategies can be the 
following: 

the portions of DNA encoding the LS, HL, Kl, K2, K3 and K4 domains 
are amplificated by PGR from HGF or MSP cDNA and then recombined to 
obtain the hybrid sequences corresponding to [A] and [C]. 
Oligonucleotides recognising sequences located at the two ends of the 
domains to be amplificated are used as primers. Primers are designed so as 
to contain a sequence allowing recombination between the DNA of a 
domain and the adjacent one. Said recombination can be carried out by 
endonuclease cleavage and subsequent ligase reaction, or making use of 
the recombinant PGR method (Innis, NA et al.., 1990, in PGR Protocols, 
Academic Press, 177-183). 

The sequence encoding the domain B (linker) can be obtained by 
synthesis of a double chain oligonucleotide, which can be inserted 
between [A] and [C] using suitable restriction sites. 

The resulting three fragments encoding for [A], [B] and [G] are then 
inserted in the correct sequence in a suitable vector. In this step it can be 
decided whether to add or not the domain D (tag), obtained by synthesis 
analogously to domain B, downstream fragment [C]. 

The recombinant expression vector can contain, in addition to the 
recombinant construct, a promoter, a ribosome binding site, an initiation 
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codon, a stop codon, optionally a consensus site for expression enhancers. 

The vector can also comprise a selection marker for isolating the 
host cells containing the DNA construct. Yeast or bacteria plasmids, such 
as plasmids suitable for Escherichia Coli, can be used as vectors, as well 
as bacteriophages, viruses, retroviruses, or DNA. 

The vectors are cloned preferably in bacterial cells, for example in 
Escherichia Coli, as described in Sambrook J., 1989, Molecular Cloning, 
Cold Spring Harbor Laboratory Press, New York, and the colonies can be 
selected, for example, by hybridisation with radiolabelled oligonucleotide 
probes; subsequently, the rDNA sequence extracted from the positive 
colonies is determined by known methods. 

The vector with the recombinant construct can be introduced in the 
host cell according to the competent cell method, the protoplast method, 
the calcium phosphate method, the DEAE-dextran method, the electric 
impulses method, the in vitro packaging method, the viral vector method, 
the micro-injection method, or other suitable techniques. 

Host cells can be prokaryotic or eukaryotic, such as bacteria, yeasts 
or mammal cells, and they will be such as to effectively produce the 
recombinant protein. 

After transformation, cells are grown in a suitable medium, which 
can be for example MEM, DMEM or RPMI 1640 in the case of mammal 
host cells. 
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The recombinant protein is secreted in the culture medium from 
which it can be recovered and purified with different methods, such as 
mass exclusion, absorption, affinity chromatography, salting-out, 
precipitation, dialysis, ultrafiltration. 
5 A simple, rapid system for the production of the molecules of the 

invention is, for example, transient expression in mammal cells. 

Accordingly, the plasmid containing the recombinant DNA 
fragment, for example PMT2 (Sambrook, J. et al., 1989, Molecular 
Cloning, Cold Spring Harbor Laboratory Press), is transfected in suitable 
10 recipient cells, such as Cos? (Sambrook, J. et al., supra) by the calcium 
phosphate technique or other equivalent techniques. Some days after 
transfection, the conditioned medium of the transfected cells is collected, 
cleared by centrifugation and analysed for its content in factor. For this 
analysis, antibodies directed against HGF or MSP, or against any tag 
15 sequence, can be used: the supernatant is immunoprecipitated and then 
analysed by western blot with the same antibody. The supernatant 
containing the recombinant factor can also be used directly for 
biochemical and biological tests. The protein can be purified, for example, 
using a poly-histidine tag sequence, by absorption on a nickel resin 
. 20 column and subsequent elution with imidazole. 

The biochemical properties of the recombinant factors of the 
invention were tested in connection with their ability to activate Met and 
Ron receptors. 

Sub-micromolar concentrations of the factors have proved to induce 
25 phosphorylation in Met tyrosine in human epithelial cells A549, whereas 
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they do not induce phosphorylation above basal values in cells expressing 
Ron. On the whole, the tests proved that the first two kringles of HGF 
maintain their ability to interact and to activate Met tyrosine kinase 
receptor, whereas the corresponding first two kringles of MSP are not 
5 sufficient for modulating the catalytic activity of the Ron receptor. 
However, the interaction with Ron, although at low affinity, can contribute 
to the recruitment of the factor at the cell surface, playing a similar role to 
low affinity receptors (of mature glycoprotein) which recruit the HGF 
intact molecule through the heparin-binding domain. 

10 The molecules of the invention have a marked biological activity, 

measured by the scattering tests, and a protecting activity against cell 
apoptosis induced by cisplatin or etoposide. 

In particular, the supernatant containing the recombinant factor has 
been found to promote scattering of epithelial cells of various nature even 

15 at nanomolar concentrations. In these tests, kidney epithelial cells 
(MDCK) or hepatocyte precursors (MLP29) were used. 

In an in vitro experimental system, in which DNA fragmentation 
typical of apoptotic cells is evaluated by the TUNEL method (Gavrieli, Y. 
et al., 1992, J. Cell. Biol. 117, 493-501), the recombinant factors protect 
. 20 against apoptosis induced by chemotherapeutic drugs at levels comparable 
with HGF and remarkably higher than MSP, The engineered molecules 
proved to be active on human primary epithelial cells from proximal 
tubule (PTECs), on an immortalised PTECs line (Loc) and on the already 
cited murine hepatocytes MLP29. 

25 Among the applications of the recombinant molecules of the 
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invention, the following can be cited: 

prevention of myelotoxicity; in particular they can be used for the 
expansion of marrow precursors, to increase proliferation of the 
hematopoietic precursors or to stimulate their entry in circle; 
5 - prevention of liver and kidney toxicity, and of mucositis following 
antineopliastic treatments; in particular the recombinant factors Can 
be used to prevent toxicity (apoptosis) on differentiated cell 
elements of liver, kidney and mucosa of the gastroenteral tract, and 
to stimulate staminal elements of cutis and mucosas to allow the 
10 regeneration of germinative layers; 

prevention of chemotherapeutic neurotoxicity. 

In general, the proteins of the invention provide the following 
advantages, compared with the parent molecules HGF and MSP: 
they are smaller molecules with a more compact structure; 
15 - they are more stable and are produced in higher amounts; 

they require no endoproteolytic cleavage for activation, which 
transforms the HGF and MSP precursors into the respective active 
forms; 

they can be engineered in combinations of different functional 
. 20 domains, thereby modulating the biological effects, increasing the 

favourable ones and reducing those undesired (for example, 
protection from apoptosis versus cell proliferation). 
The invention has to be considered also directed at amino acid and 
nucleotide sequences referred to formula (I), having modifications which 
25 can, for example, derive from degeneration of genetic code, without 
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therefore modifying the amino acid sequence, or from the deletion, 
substitution, insertion, inversion or addition of nucleotides and/or bases 
according to all the possible methods known in the art. 

Furthermore, the invention relates to the expression vectors 
5 comprising a sequence encoding for a protein of general formula (I), 
which can be plasmids; bacteriophages, viruses, retroviruses, or others, 
and to host cells containing said expression vectors. 

Finally, the invention relates to the use of the recombinant proteins 
as therapeutical agents, and to pharmaceutical compositions containing an 
10 effective amount of the recombinant proteins together with 
pharmacologically acceptable excipients. 

Description of tlie Figures 

(In the following legends, -His located after the name of the parent 
factors, truncated or recombinant, or of the plasmids, means that the 
15 respective sequences contain a poly-histidine tag). 
Figure 1: 

a) Nucleotide and amino acid sequence of human HGF (Gene 
Bank # M73239; Weidner, K.M., et al., 1991, Proc. Acad. Sci. USA, 
88:7001-7005). In contrast to the cited reference, in the numbering used 

. 20 herein, nucleotide No. 1 is the first base of the initiation codon (the A of 
the first ATG). The first amino acid is the corresponding methionine. The 
cDNA untranslated regions at 5' and 3' are neither represented nor 
considered in the numbering. 

b) Nucleotide and amino acid sequence of human MSP (Gene 
25 Bank # LI 1924; Yoshimura, T., et al., 1993, J. Biol. Chem., 268:15461- 
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15468). In contrast to the cited reference, in the numbering used herein 
nucleotide No. 1 is the first base of the initiation codon (the A of the first 
ATG). The first amino acid is the corresponding methionine. The cDNA 
untranslated regions at 5' and 3' are neither represented nor considered in 
5 the numbering. 

Figure2:\- ■ •■ ... j' ' ... rr- ^ 

a) Molecular structure of Metron Factor- 1. The leader sequence 
is removed from the cells used for the production before secretion and is 
therefore absent in the mature molecule. The poly-histidine tag can be 

10 removed by digestion with the protease enterokinase. 

b) Nucleotide and amino acid sequence of Metron-Factor- 1 . The 
nucleotide sequence starts with the EcoRI site and terminates with the Sail 
site (first six bases and last six bases, respectively). The initiation codon 
(ATG) and the stop codon (TAG) are underlined. 

15 Figure 3; 

a) Molecular structure of Magic Factor- 1. The leader sequence 
is removed from the cells used for the production before secretion and is 
therefore absent in the mature molecule. Poly-histidine tag can be removed 
by digestion with the protease enterokinase. 
20 b) Nucleotide and amino acid sequence of Magic Factor- 1 . The 

nucleotide sequence starts with the Sail site (first six bases and last six 
bases, respectively). The initiation codon (ATG) and the stop codon 
(TAG) are underlined. 
Figure 4: 

25 Production of Metron-F-1 by transient transfection of mammal 
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cells. The conditioned supematants from BOSC cells transfected with the 
control plasmid (CTRL) or with pRK7-Metron F-l-His were 
immunoprecipitated with an anti-MSP polyclonal antibody and detected 
by western blot with the same antibody. 
5 Figure 5: 

' Quantitation of the recombinant proteins by western blot. (A) The 
proteins were absorbed on Sepharose-A-heparin beads and detected with 
an anti-poly-histidine monoclonal antibody. (B) The proteins were 
immunoprecipitated with an anti-MSP polyclonal antibody and detected 
10 with an anti-poly-histidine monoclonal antibody. 
Figure 6: 

Scattering test carried out on kidney epithelial cells (MDCK) using 
the recombinant proteins prepared by transient transfection. The protein 
content was quantified by western blot (see Fig. 5). (A) non-stimulated 
15 cells; (B) cells stimulated with control supernatant; (C) cells stimulated 
with HGF-His; (D) cells stimulated with NK2-HGF-His; (E) cells 
stimulated with Metron Factor- 1; (F) cells stimulated with Magic Factor- 1. 
Figure 7: 

Activation (phosphorylation) of Met receptor by the hybrid factor 
20 Metron Factor- 1. Human epithelial cells (A549) were stimulated with 
supematants conditioned from BOSC cells transfected with the control 
plasmid (CTRL) or with pRK7-Metron-F-l-His (METRON F-1) at the 
indicated dilutions. Cell lysates from the stimulated cells were 
immunoprecipitated with an anti-Met monoclonal antibody and detected 
25 by western blot with an anti-phosphotyrosine monoclonal antibody. 
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Figure 8: 

Protective effect of Metron-F-1 against acute renal failure induced 
by HgCl2 in vivo. Balb-c mice were injected i.v. with Metron-F-1 or 
vehicle at 0.5 h before and 6, 12, 24, 36 and 48 h after HgCl2 i.v. 
5 administration. BUN and histological evaluation of renal necrosis were 
measured at 72 h. 

Data expressed as mean + e.s. of 7 animals/group (BUN) or 3 
animals/group (histology). 

The following examples illustrate in greater detail the invention. 
10 Example la; Preparation of the recombinant construct encoding 
Metron Factor-1 

HGF cDNA was obtained by the RT-PCR technique (Reverse 
Transcriptase PGR; in: Innis, M. A., et al.., 1990, PGR Protocols, 
Academic Press, 21-27) from a human lung fibroblast cell line (MRC5; 
15 Naldini, L. et al., 1991, EMBO J. 10: 2867-2878). MSP cDNA was 
obtained with the same technique from human liver (Gaudino, G., et al., 
1994, EMBO J. 13: 3524-3532). 

The fragment corresponding to MSP LS-HL-K1-K2 was amplified 
by PGR using MSP cDNA as template and the following oligonucleotides 
20 as primers: 
PI (sense) 

5' CGGGCGGAATTCGAGGATGGGGTGGGTGGGAGTGGT 3' 
P2 (antisense) 

5' CGGGGGCTCGAGGGGGGGGTGTGCCTGGGAGCGGGA 3' 
25 in which the underlined palindromic sequences are the restriction sites for 
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the enzymes EcoRI (oligonucleotide PI) and Xhol (oligonucleotide P2). 
The PGR product was digested with the restriction enzymes EcoRI and 
Xhol and then purified by electrophoresis on agarose gel. 

The fragment corresponding to HL-K1-K2 of HGF was amplified 
by PGR using HGF cDNA as template and the following oligonucleotides 
as primers: ' --^ \ ' ■■■ ^ ■ . 

P3 (sense) 

5' GGGGCGTCTAGAGGGACAAAGGAAAAGAAGAAATAC 3' 
P4 (antisense) 

5' GGGGGGAAGCTTTGTGAGCGGATGTTTTAATTGGAC 3' 
in which the underlined palindromic sequences are the restriction sites for 
the enzymes Xbal (oligonucleotide P3) and Hindlll (oligonucleotide P4). 
The PGR product was digested with the restriction enzymes Xbal and 
Hindlll and then purified by electrophoresis on agarose gel. 

For the linker sequence, the following partially complementary 
oligonucleotides were synthesised, and were subsequently annealed to 
obtain a double strand DNA fragment with sticky ends: 
P5 (sense) 

5'TOGAGGGGGGTGGCGGTKnGGTGGCGGTGGCK^^ 
P6 (antisense) 

5- GIAGAGAAa3GCGAGa3CajGAGa:AGa3GCACCAGAAGC^^ 3' 
in which the underlined bases are the sequences compatible with the 
restriction sites for the enzymes Xhol (oligonucleotide P5) and Xbal 
(oligonucleotide P6). 

The resulting three DNA fragments were subcloned in the EcoRI- 
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Hindlll sites of the expression vector pRK7 (Gaudino, G., et al., 1994, 
EMBO J. 13: 3524-3532), to obtain the recombinant plasmid pRK7- 
Metron-F-1, containing all the components of Metron Factor- 1 except the 
tag sequence. 

For the insertion of the tag sequence, the following partially 
complementary oligonucleotides were synthesised, and were subsequently 
annealed to obtain a double strand DNA fragment with sticky ends: 
P7 (sense) 

5'AGCrGACGACGA(XACAAACACCA(XACC^ 
P8 (antisense) 

5' AGQCTQGAOXTAGTGGTGGTGGTGGTGGTGGTCT^^ 3' 
in which the underlined bases are compatible with the Hindlll restriction 
site and the boxed palindromic sequences are the consensus sequences for 
the enzyme Sail. The resulting double strand DNA fragment was inserted 
in the restriction site Hindlll of the recombinant plasmid obtained at the 
previous step (destroying the Hindlll site and creating the Sail site), to 
obtain the plasmid pRK7-Metron-F-l-His. 
Example lb; Production of Metron Factor-l 

The expression vector pRK7 contains a promoter of human 
cytomegalovirus immediate-early gene (CMV) and an episomal replication 
origin site of the DNA virus SV40. Therefore, this plasmid is particularly 
suitable for the expression of proteins in cells expressing the large T 
antigen of the virus SV40, such as kidney epithelial BOSC cells 
(Sambrook, J. et al.., 1989, Molecular Cloning, Cold Spring Harbor 
Laboratory Press). Metron Factor-l can then be produced by transient 
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transfection of plasmid pRK7-Metron F-l-His in BOSC cells. 

For transfection, 10^ cells are seeded at day 0 in a 100 mm plate in 
90% Dulbecco's Modified Eagle Medium (DMEM)-10% bovine calf serum 
(10 ml/plate). At day 1, cells are transfected with 10 ng/plate of pRK7- 
Metron-F-l-His by lipofection, using the protocol provided by the 
lipofectin producer (Gibco-BRL); At day 2, the DNA-cbntaining medium 
is substituted by fresh medium with low content in serum (99.5% DMEM- 
0.5% bovine calf serum). At day 4 (48 hours after the end of the 
transfection), the medium is collected, cleared by centrifugation, and 
analysed for its content in Metron Factor- 1. 

This analysis can be carried out in different ways. For example, the 
recombinant protein present in the cleared supernatant can be 
immunoprecipitated with an anti-MSP antibody and then detected by 
western blot with the same antibody (Fig. 4). In the example shown in 
figure 4, 500 jil of supernatant (cleared by centrifiigation, buffered in 25 
mM HEPES and added with a protease inhibitors cocktail) were 
immunoprecipitated (2 hours at 4° C) with 20 yd of Sepharose-A beads 
(Pharmacia) covalently conjugated with 2 p,l of anti-MSP polyclonal 
antibody. The beads pellet was washed 3 times with 500 |Ltl of washing 
buffer (20 mM HEPES pH 7.4; 150 mM NaCl; 0.1% Triton X-100; 10% 
glycerol) and heated at 90° C for 2 minutes in 100 ^1 of Laemmli buffer. 
Eluted proteins were separated by SDS-PAGE on 8% BlS-acrylamide gel, 
transferred onto membrane (Hybond-C; Amersham) and analysed by 
western blot. For this analysis, the same rabbit serum used for 
immunoprecipitation was employed as primary antibody with a 1:1000 
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dilution and protein A conjugated with peroxidase (Amersham) was used 
as secondary antibody. Protein A was detected by ECL (Amersham) 
following the protocol provided by the producer. 

Alternatively, the recombinant protein can be partially purified by 
adsorption on Sepharose-A beads conjugated with heparin and subsequent 
analysis by western blot using antibodies directed to poly-histidine tag 
(Fig. 5). 

In the example shown in figure 5, the Sepharose-A-heparin beads 
(20 1^1; Pierce) were incubated (4 hours at 4° C) with 500 fj.1 of supernatant 
(cleared by centrifugation, buffered in 25 mM HEPES and added with a 
protease inhibitors cocktail) in the presence of 500 mM NaCl, washed 
with suitable buffer (500 mM NaCl; 20 mM HEPES pH 7.4; 0.1% Triton 
X-100; 10% glycerol) and heated at 90"" C for 2 minutes in 100 ^il 
Laemmli buffer. Eluted proteins were separated by SDS-PAGE on 8% bis- 
acrylamide gel, transferred onto membrane (Hybond-C; Amersham) and 
analysed by western blot. For this analysis, a mouse monoclonal antibody 
to poly-histidine (Invitrogen) diluted 1:5000 was used as primary antibody 
and an anti-mouse IgG ovine antibody conjugated with peroxidase 
(Amersham) was used as secondary antibody. The secondary antibody was 
detected by ECL (Amersham) following the protocol provided by the 
producer. 

The procedure of adsorption on heparin beads can also be used as 
protocol for the semi-purification of the recombinant protein. Furthermore, 
the molecule can additionally be purified making use of the poly-histidine 
affinity to heavy metals such as nickel. The protein containing poly- 
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histidine tag can be adsorbed on a nickel resin column (Invitrogen) and 
subsequently eluted with imidazole (the detailed protocol is provided by 
the manufacturer). 

Example Ic: METRON-F-1 production in insect cells 

The cDNA encoding for Metron-Fl was subcloned in a suitable 
expression vector (ptFASTBAC) to generate a recombinant plasmid 
containing the Metron-Fl gene (p-FASTBAC-Metron). A competent E. 
Coli strain (DHIO Bac) was transformed with p-FASTBAC-Metron to 
generate BACMID DNA. The DNA of positive colonies was isolated and 
checked by PGR to show the correct integration of the expression vector. 
Subsequently, the DNA from three clones was transfected into Sf9 insect 
cells with CellFECTIN reagent to produce virus particles. Virus titer was 
tested by a plaque assay. Single plaques were isolated and used for further 
propagation of the baculovirus. Viral stock was subsequently expanded in 
insect cells to scale up METRON-F-1 production. To verify protein 
expression, insect cells were infected with a multiplicity of infection 
(MOI) of 1 in a small-scale reactor. Samples of supernatants were 
analysed by SDS-PAGE followed by western blotting. 

To produce amounts adequate for in vivo testing, insect cells were 
propagated in a 2. 5 -Liter stirred tank bioreactor. Cells were grown to a 
cell density of 1.106 ml"' before they were infected with a MOI of 1. Cell 
suspension was harvested 3 days post infection. The supernatant 
containing the recombinant protein was separated by centrifugation. The 
presence of Metron F-1 in the supernatant was proved by SDS-PAGE 
followed by western blotting. Metron F-1 was pre-purified by a dual step 
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affinity chromatography on heparin sepharose (heparin-Hi Trap, 
Pharmacia) at 6° C. For in vivo testing or for further purification steps, the 
eluted fractions containing Metron F-1 were desalted by Sephadex G-25 
chromatography (PD-10 or HiPrep 26/10, Pharmacia). Metron F-1 was 
5 further purified by chromatography on HisTrap columns (Pharmacia) and 
eluted by an imidazole gradient (0-0:5 M) using either a low-pressure 
system (Econo System, BIO-RAD) or an FPLC system (Pharmacia). 
Metron F-1 was eluted at an imidazole concentration of about 0.15 M. For 
in vivo testing, the eluted fractions containing Metron F-1 were freed of 
10 imidazole by Sephadex G-25 chromatography as already described, using 
the buffer to be used for animal treatment. 

Example 2a: Preparation of the recombinant construct encoding for 
Magic Factor-1 

HGF cDNA and the plasmid pRK7-Metron-F- 1 -His described above 
15 were used as starting DNA. The fragment corresponding to LS-HL-K1-K2 
of HGF was amplificated by PGR using HGF cDNA as template and the 
following oligonucleotides as primers: 
P9 (sense) 

5' CGCGCGGGATCCGCCAGCCCGTCCAGCAGGACCATG 3' 
20 PIO (antisense) 

5' CGCGCGAAGCTTTGTCAGCGCATGTTTTAATTGCAC 3' 
in which the underlined palindromic sequences are the restriction sites for 
the enzymes BamHI (oligonucleotide P9) and HindlH (oligonucleotide 
PIO). The PGR product was digested with the restriction enzymes BamHI 
25 and Hindlll and then purified by electrophoresis on agarose gel. 
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For the linker, the following partially complementary 
oligonucleotides were synthesized, and subsequently annealed to obtain a 
double strand DNA fragment with sticky ends: 
PI 1 (sense) 

5'AGCITCX3GGOGGirXKX]iGTK^^ 

P12 (antisense) . u, ,: - 

in which the underlined bases are the sequences compatible with the 
restriction sites for the enzymes Hindlll (oligonucleotide Pll) and Xbal 
(oligonucleotide P12). The fragment resulting by PGR and the double 
strand linker sequence were inserted in the plasmid pRK7-Metron-F-l-His 
in place of the fragment EcoRI-Xbal by means of an EcoRI-BamHI 
adapter, to obtain the plasmid pRK7-Magic-F-l-His. 
Example 2b: Production of Magic Factor-1 

Magic Factor-! is produced on a small scale by transient 
transfection of BOSC cells analogously to what described for Metron 
Factor-1. Semi-purification is performed by adsorption on Sepharose-A 
beads conjugated with heparin followed by Western blot analysis using 
anti-poly-histidine antibodies (Fig, 5), 

Example 3 : Biological activity (scattering) on epithelial cells. 

The biological activity of recombinant HGF, NK2-HGF, Metron 
Factor-1 and Magic Factor-1 was tested by a "scatter" assay on MDCK 
epithelial cells. For this functional test, cells are plated at day 0 in 96-well 
plates (10^ cells/well) in 90% DMEM - 10% bovine calf serum. At day 1 
the medium is substituted with fresh medium buffered with 50 mM 
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HEPES pH 7.4 and the supernatant containing the recombinant protein is 
added at different dilutions. At day 2 cells are washed with DPBS 
(Dulbecco's Phosphate Buffered Saline), fixed in 11% glutaraldehyde, 
stained with a Crystal-Violet solution and analysed by microscopy. The 
5 scattering activity is evaluated observing the morphology of the colonies, 
which are clustered in the negative control (non-stimjilated cells or 
stimulated with supernatant containing no factors) whereas they are 
dispersed in the positive control (HGF-His). The morphology of the cells 
themselves also varies upon stimulation: in fact, as it can be observed in 

10 Fig. 6, cells stimulated with HGF-His and Metron Factor- 1 have a more 
oblong, spindle-shaped form, characterised by protrusions of the cell 
membrane called pseudopodes. These morphological variations are the 
consequence of factor-induced activation of a genetic program involving 
the modification of a series of cellular parameters, such as digestion of 

15 cell matrix by specific proteases and increase in motility. 

The Table summarises the results of different tests, obtained with 
factors HGF, NK2-HGF, Metron Factor- 1 and Magic Factor- 1 on MDCK 
cells. The scattering units reported indicate the maximum dilution of the 
conditioned supernatant containing the factor, at which motogenic activity 
.20 could be observed. Values are normalised for the protein content 
determined by western blotting as described above (see Fig. 5). These data 
indicate that the hybrid factors Metron Factor- 1 and Magic Factor- 1 have a 
scattering activity approximately three magnitudes higher than that of the 
NK2-HGF-His truncated form and one magnitude higher than that of 

25 HGF-His parental factor. 
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HGF-his 


NK2-his 


Metron F-1 


Magic Factor- 1 


Scatter units 


900 ± 29 


6 ± 5 


5500 ± 1532 


7600 ± 150 



Table. Scattering activity of factors HGF-His, NK2-HGF-His 



Metron Factor- 1 measured on kidney epithelial cells (MDCK). The 
scatteritig units reported indicate the maximum dilution of tiie conditioned 
5 supernatant containing the factor, at which a motogenic activity can be 
observed. Values are normalised for the protein content determined by 
western blotting. 

Example 4a : Test for the evaluation of protection against programmed 
cell death (apoptosis). 

10 One of the most characterised side effect of the chemotherapeutic 

drug cisplatin is the induction of programmed cell death (apoptosis) of 
epithelial cells of the proximal tubule, which leads to acute renal failure 
(ARF). Thus, a factor that protects against cisplatin-induced cytotoxicity 
is highly desirable. An in vitro functional test has been used, which allows 

15 to evaluate the percentage of cisplatin-treated apoptotic cells in the 
presence or in the absence of a survival factor. This system utilises a cell 
line (LOG) derived from epithelial cells of human kidney proximal tubule, 
immortalised by ectopic expression of SV40 large T antigen. For the 
functional test, cells are plated at day 0 in 96-well plates (10^ cells/well) in 

20 90% DMEM - 10% bovine calf serum. At day 1, the medium is substituted 
with medium containing 0.5% bovine calf serum buffered with 50 mM 
HEPES pH 7.4, which is added with different dilutions of the supernatant 
containing the recombinant factor. Cells are pre-incubated with these 
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factors for 6 hours, and then further incubated in the presence of 1 0 ng/ml 
cisplatin. At day 2, cells are washed with DPBS and the percentage of 
apoptotic cells is evaluated by the TUNEL technique (Boehringer 
Mannheim). The same kind of tests can be performed using primary 
5 cultures of human epithelial cells of kidney proximal tubule (PTEC). 
These tests proved that Metrpn Factor- 1 and Magic Factor- 1 have 
protecting activity against cisplatin-induced programmed cell death. 
Example 4b: Protection against cisplatin-induced cytotoxicity by 
transient gene delivery of Metron Factor-1 and Magic Factor-1 
10 The protective effect of Metron F-1 and Magic F-1 against 

cisplatin-induced cytotoxicity was further demonstrated by a transient 
gene delivery approach. Simian kidney epithelial cells (COS) were 
transfected with a control empty vector, an expression vector for Metron 
F-1, or an expression vector for Magic F-1. Following transfection, cells 
15 were treated for 16 hours with cisplatin (20 |ag/ml) and the percentage of 
surviving cells in each transfection was determined. Cisplatin treatment 
was calibrated to cause the death of approximately 20% of the cells in the 
negative control. Ectopic expression of Metron F-1 or Magic F-1 increased 
the survival rate to about 92.3% and 94.0%, respectively. 
. 20 Example 5 i Activation of the Met receptor by Metron Factor-1 and 
Magic Factor-1 

The ability of Metron Factor-1 and Magic Factor-1 to activate the 
Met receptor was tested by analysing the ability of the recombinant factors 
to induce tyrosine phosphorylation of Met in human epithelial cells 
25 (A549). For this analysis, A549 cells at 90% confluence in a 100 mm petri 
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dish were stimulated for 10 minutes with 1 ml of conditioned supernatant 
containing Metron Factor- 1, Magic Factor- 1 or no factor (as negative 
control) diluted 1:2.5 or 1:10 in DMEM. After stimulation, cells were 
washed in ice with PBS, lysated in 200 yd of lysis solution (1% Triton X- 
100, 5 mM EDTA, 150 mM NaCl, 20 mM Tris-HCl pH 7.4), added with a 
cocktail of protease inhibitors, immuriojprecipitated for 2 :hdAirs at 4'' C 
with 1 0 1^1 of Sepharose-A beads covalently conjugated with an anti-Met 
monoclonal antibody (Naldini, L. et al., 1991, EMBO J. 10: 2867-2878), 
washed 3 times in the same lysis solution, and heated at 90°C for 2 
minutes to elute the absorbed proteins. These were separated by SDS- 
PAGE on a 8% BIS-acrylamide gel, transferred onto a membrane 
(Hybond-C; Amersham) and analysed by western blot. A mouse 
monoclonal antibody against phosphotyrosine (UBI) diluted 1:10000 was 
used as primary antibody and an anti-mouse IgG ovine antibody 
conjugated with peroxidase (Amersham) was used as secondary antibody. 
The secondary antibody was detected by ECL (Amersham) following the 
protocol provided by the manufacturer. This analysis revealed that Metron 
F-1 and Magic F-1 potently activate the Met receptor (Fig. 7). 
Example 6: Protection against chemotlierapy-induced renal failure by 
Metron Factor-1 in vivo 

Metron-F-1 was tested in a model of nephrotoxicity in Balb-c mice. 
The method used was substantially as described (Kawaida K et al., 1994, 
Hepatocyte growth factor prevents acute renal failure and accelerates renal 
regeneration in mice, Proc. Natl. Acad. Sci. 91:4357-4361). Briefly, renal 
failure was induced in male Balb-c mice weighing 20-25 g by an i.v. 
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injection of 7.5 mg/kg of HgCl2 (7 animals/group). Renal damage was 
assessed by analysis of Blood Urea Nitrogen (BUN) and by histological 
evaluation, 72 h after HgCl2 injection. Metron-F-l was dissolved in 0.2 M 
NaCl, containing 0.01% Tween 80 and 0.25% human serum albumin and 
5 administered i.v. (100 ng/kg in a posological volume of 6.6 ml/kg) 0.5 h 
before and 6, 12, 24, 36 3nd"48 h iafter/HgCi2 inj^^^^^^ 

were treated with the same amount of vehicle according to the same 
scheme. 

Metron-F-l significantly prevented the onset of acute renal failure 
10 induced by HgCl2, evaluated in terms of BUN (figure 8). BUN values 
were closely paralleled by the histological findings, scored by an 
independent investigator. 

In the following sequence listing: 
SEQ. ID. NO. 1: Magic F-1 DNA coding sequence; 
15 SEQ. ID. NO. 2: Magic F-1 amino acid sequence; 
SEQ. ID. NO. 3: Metron F-1 DNA coding sequence; 
SEQ. ID. NO. 4: Metron F-1 amino acid sequence. 
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CLAIMS 

1 . Recombinant proteins comprising two superdomains, separated by a 
spacer sequence (linker), obtained combining the HL and K1-K4 domains 

5 of HGF and MSP a chains. 

2. Recombinant proteins as claimed in claim. 1, of general formula (!):„ 

[A] - B - [C] - (D)y (I) 

in which 

[A] corresponds to the sequence (LS)n,-HL-Kl-(K2)n-(K3)o-(K4)p 
10 wherein (the numbering of the following amino acids refers to the HGF 
and MSP sequences as reported in Fig, 1 and 2, respectively): 
LS is an amino acid sequence corresponding to residues 1-31 of HGF or 1- 
18 of MSP; 

HL is an amino acid sequence derived from the a chain of HGF starting 
15 between residues 32-70 and ending between residues 96-127; or it is an 
amino acid sequence derived from the a chain of MSP starting between 
residues 19-56 and ending between residues 78-109; 

Kl is an amino acid sequence derived from the a chain of HGF starting 
between residues 97-128 and ending between residues 201-205; or it is an 

20 amino acid sequence derived from the a chain of MSP starting between 
residues 79-1 10 and ending between residues 186-190; 
K2 is an amino acid sequence derived from the a chain of HGF starting 
between residues 202-206 and ending between residues 283-299; or it is 
an amino acid sequence derived from the a chain of MSP starting between 

25 residues 187-191 and ending between residues 268-282; 
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K3 is an amino acid sequence derived from the a chain of HGF starting 
between residues 284-300 and ending between residues 378-385; or it is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 269-283 and ending between residues 361-369; 
5 K4 is an amino acid sequence derived from the a chain of HGF starting 
between residues 379-386' and endiiig between residues 464-487;^^^^^^ is 
an amino acid sequence derived from the a chain of MSP starting between 
residues 362-370 and ending between residues 448-481; 
m, n, o, p are 0 or 1 ; 
10 the sum n + o + p is an integer from 1 to 3 or 0, with the proviso that 
n > o > p; 

B is the sequence [(X)q Y]r, wherein X = Gly and Y = Ser, or Cys, or Met, 
or Ala; 

q is an integer from 2 to 8; 
15 r is an integer from 1 to 9; 

[C] corresponds to the sequence HL-Kl-(K2)s-(K3)t-(K4)u 
wherein HL, K1-K4 are as defined above, 

s, t, u are 0 or 1; the sum s + t + u is an integer from 1 to 3 or 0, with the 
proviso that s > t > u; 
.20 D is the sequence W-Z, wherein W is a conventional proteolytic site, Z is 
any tag sequence useful for the purification and detection of the protein; y 
is 0 or 1. 

3. Recombinant proteins according to claims 1-2, in which the HL 
domain is a sequence of HGF a chain ranging from amino acids 32 to 127, 
25 or a sequence of MPS a chain ranging from amino acids 19 to 98; the Kl 
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domain is a sequence of HGF a chain ranging from amino acids 128 to 
203, or a sequence of MPS a chain ranging from amino acids 99 to 188; 
the K2 domain is a sequence of HGF a chain ranging from amino acids 
204 to 294, or a sequence of MPS a chain ranging from amino acids 189 
5 to 274; the K3 domain is a sequence of HGF a chain ranging from amino 
acids 286 to 383, or a sequence of MPS a chain ranging from amino acids 
275 to 367; the K4 domain is a sequence of HGF a chain ranging from 
amino acids 384 to 487, or a sequence of MPS a chain ranging from 
amino acids 368 to 477. 

10 4. Recombinant proteins according to claims 1-3 of formula (11): 
LSmsp-HLmsp-K 1 msp-K2msp-L-HLhgf-K 1 hgf-K2hgf-D (II) 
in which LSmsp is the sequence 1-18 of MSP, HLmsp is the sequence 19-56 
of MSP, KImsp is the sequence 99-188 of MSP, K2msp is the sequence 
189-274 of MSP, HLhgf is the sequence 32-127 of HGF, KIhgf is the 

15 sequence 128-203 of HGF, K2hgf is the sequence 204-294 of HGF, L is 
the sequence (Gly4Ser)3, D is the sequence Asp4-Lys-His6. 

5. Recombinant proteins according to claims 1-3 of formula (III): 
LShgf-HLhgf-K 1 hgf-K2hgf-L-HLhgf-K.1 hgf-K.2hgf-D (III) 

in which HLhgf, KIhgf, K2hgf, L and D are as defined in claim 4, LShgf 
20 is the sequence 1-31 of HGF. 

6. Nucleotide sequences encoding for the recombinant proteins of 
claims 1-5. 

7. Expression vectors comprising the nucleotide sequences of claim 6. 

8. Prokaryotic or eukaryotic host cell transformed with the expression 
25 vector of claim 7. 
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9. Process for preparing the recombinant proteins of claims 1-5, which 
comprises the following steps: 

a) construction of DNA encoding the desired protein; 

b) insertion of DNA in an expression vector; 

5 c) transformation of a host cell with recombinant DNA (rDNA); 

d) culture of the transformed host cell so as to express the recombinant 
protein; 

e) extraction and purification of the produced recombinant protein. 

10. Process according to claim 9, wherein the host cell is kidney 
10 epithelial BOSC cell or SF9 insect cell. 

11. Recombinant proteins of claims 1-5 for use as therapeutical agents. 

12. Use of recombinant proteins of claims 1-5 in the manufacture of a 
medicament for the prevention or treatment of chemotherapeutic-induced 
toxicity. 

15 13. Use according to claim 10, wherein the chemotherapeutic-induced 
toxicity is myelotoxicity, kidney toxicity, neurotoxicity, mucotoxicity and 
hepatotoxicity. 

14. Pharmaceutical compositions containing an effective amount of the 
recombinant proteins of claims 1-5, in combination with 
. 20 pharmacologically acceptable excipients. 
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FIG 



la 



1 



llgTGGGTGACCAAACTCCTGCCAGCCCTGCTGCTGCAGCATGTCCTCCTGCATCTCCTC 
TACACCCACTGGTTTGAGGACGGTCGGGACGACGACGTCGTACAGGAGGACGTAGAGGAG 



60 



1 M W V T K L L P A L L L Q H V L L H L L 20 



CTGCTCCCCATCGCCATCCCCTATGCAGAGGGACAAAGGAAAAGAAGAAATACfiATTCAT 
GACGAGGGGTAGCGGTAGGGGATACGTCTCCCTGTTTCCTTTTCTTCTTTATGTTAAGTA 
21LLPIAIPYAEGQRKRRNTIH 40 



GAATTCAAAAAATCAGCAAAGACTACCCTAATCAAAATAGAT CCAGCACT GAAGAT AAAA 
121 + + + + + + IQ 

CTTAAGTTTTTTAGTCGTTTCTGATGGGATTAGTTTTATCTAGGTCGTGACTTCTATTTT 
41EFKKSAKTTLIKIDPALKIK 60 



ACCAAAAAAGTGAATACTGCAGACCAATGTGCTAATAGATGTACTAGGAATAAAGGACTT 
181 + + + + + + 24 

TGGTTTTTTCACTTATGACGTCTGGTTACACGATTATCTACATGATCCTTATTTCCTGAA 
61TKKVNTADQCANRCTRNKGL 80 



CCATTCACTTGCAAGGCTTTTGTTTTTGATAAAGCAAGAAAACAATGCCTCTGGTTCCCC 
241 + + + + + + 300 

GGTAAGTGAACGTTCCGAAAACAAAAACTATTTCGTTCTTTTGTTACGGAGACCAAGGGG 
81PFTCKAFVFDKARKQCLWFP 100 



TTCAATAGCATGTCAAGTGGAGTGAAAAAAGAATTTGGCCATGAATTTGACCTCTATGAA 
301 + + + + + + 3S0 

AAGTTATCGTAGAGTTCACCTCACTTTTTTCTTAAACCGGTACTTAAACTGGAGATACTT 
101 FNSMSSGVKKEFGHEFDLYE 120 



AACAAAGACTACATTAGAAACTGCATCATTGGTAAAGGACGCAGCTACAAGGGAACAGTA 
361 + + + + + + 420 

TTGTTTCTGATGTAATCTTTGACGTAGTAACCATTTCCTGCGTCGATGTTCCCTTGTCAT 
121NKDYIRNCIIGKGRSYKGTV 140 



TCTATCACTAAGAGTGGCATCAAATGTCAGCCCTGGAGTTCCATGATACCACACGAACAC 
421 + + + + + + 480 

AGATAGTGATTCTCACCGTAGTTTACAGTCGGGACCTCAAGGTACTATGGTGTGCTTGTG 
141SITKSGIKCQPWSSMIPHEH 160 
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TCGATAGCCCCATTTCTGGATGTCCTTTTGATGACAGCTTTAGGAGCTCCCCTTCTTCCC 
161SYRGKDLQENYCRNPRGEEG 



661 ^E^E^^E^JfE^E^IZZJ^^^^^^^^^^^ATCATCAGACACCACACCGGCACAAATTC 

TGTCTTAGTCCGTTCTAAACAGTCGCGACCCTAGTAGTCTGTGGTGTGGC^ 
221 TESGKICQRWDHQTPHRHKF 



781 E^EE5™5J E^Zf EEEEZ+^^^ZE^I^^^^^^^^^^^^^^^^^^^'^^^^^T'^^^''^'^ 

GTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTGGGCGACCCTCATGACAC^ 
261QPRPWCYTLDPHTRWEYCAI 

841 ^^E^EEEJEZE^E^E^JE^^^^^^^^A^'^^^TGTTCCTTTGGAAACAACTGAATGC 

TTTTGTACGCGACTGTTATGATACTTACTGTGACTACAAGGAAACCTTTGTTG^^ 
281 KTCADNTMNDTDVPLETTEC 



961 EEIE^EEEIIEEE^EIEZ^+^Z^^^^^^^^^^^^^'^^^^^^^A^'^^cT'^^^^^TTTCAAG 

ACAGTCGCAACCCTAAGAGTCATAGGAGTGCTCGTACTGTACTGAGGACTTTTAAAGTTC 



540 
180 



541 EE^EEEZ EE+EIIIE^E^-SE^IEE^E^*^^^^^^^'^-^^^^^^^'^^'^^^^^'^'^^'^^^^*^ 

CCTGGGACCACAAAGTGTTCGTTAGGTCTCCATGCGATGCTTCAGACACTGTAAGGAGTC 
181GPWCFTSNPEVRYEVCDIPQ 200 



660 



601 IEEIE?E^JZIE^EEE-+E^E^IE'^^W?-^^^A<^T™<=GAGGTCTCATGGATCAT 

ACAAGTCTTCAACTTACGTACTGGACGTTACCCCTCTCAATAGCTCCAGAGTA 
201CSEVECMTCNGESYRGLMDH 220 



720 
240 



780 



721 EZEEEEE^^EH^ZEEEE^E^EEEE'^P^^^^^'^^'^^^^'^^^'^^^^^^^^^^T^Gc 

AACGGACTTTCTATAGGGCTGTTCCCGAAACTACTATTAATAACGGCGTTAGGGCTA^ 
241LPERYPDKGFDDNYCRNPDG 260 



840 
280 

900 
300 



ATCCAAGGTCAAGGAGAAGGCTACAGGGGCACTGTCAATACCATTTGGAATGGAATTCCA 
">-'■'■ I 1 1 — — ^ i. 960 

TAGGTTCCAGTTCCTCTTCCGATGTCCCCGTGACAGTTATGGTAAACCTTACCTTAAGGT 
301IQGQGEG YRGTVNTIWNGIP 320 



1020 

321 CQRWDSQYPHEHDMTPENFK 340 



021 lEE^EE^EJZ^EE^E^^^^^'^^EI^^^^^^^^^^^^'^"^^^'^^"^^^^*^^^^^'^ 

ACGTTCCTGGATGCTCTTTTAATGACGGCTTTAGGTCTACCCAGACTTAGTGGGACCACA ^ ° ^ ' 
341 CKDLRENYCRNPDGSESPWC 360 



1081 lE'^^^-E^^'^+'^'^^^^^E^^EE^^^*^'^^^^''^^^^^^'^^^^^^'^'^^^^^^'^^'^ 

AAATGGTGACTAGGTTTGTAGGCTCAACCGATGACGAGGGTTTAAGGTTTGACACTATAC ^ ^ ° 
361 FTTDPNIRVGYCSQIPNCDM 380 
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TCACATGGACAAGATTGTTATCGTGGGAATGGCAAAAATTATATGGGCAACTTATCCCAA 

1141 + + + + + -r 1200 

AGTGTACCTGTTCTAACAATAGCACCCTTACCGTTTTTAATATACCCGTTGAATAGGGTT 

381 SHGQDCYRGNGKNYMGNLSQ 400 

ACAAGATCTGGACTAACATGTTCAATGTGGGACAAGAACATGGAAGACTTACATCGTCAT 

1201 + + + + + + 1260 

TGTTCTAGACCTGATTGTACAAGTTACACCCTGTTCTTGTACCTTCTGAATGTAGCAGTA 

401 TRSGLTCSMWDKNMEDLHRH 420 

^ ^ ATCTTCTGGGAACCAGATGCAAGTAAGCTGAATGAGAATTACTGCCGAAATCCAGATGAT , 
TAGAAGACCCTTGGTCTACGTTCATTCGACTTACTCTTAATGACGGCTTTAGGTCTACTA 
421 I FWE PDAS KLNE NYC RN P D D 440 

^^^^ GACGCTCATGGACCCTGGTGCTACACGGGAAATCCACTCATTCCTTGGGATTATTGCCCT 
CTGCGAGTACCTGGGACCACGATGTGCCCTTTAGGTGAGTAAGGAACCCTAATAACGGGA 
441 DAHGPWCYTGNPLIPWDYCP 460 

^^^^ ATTTCTCGTTGTGAAGGTGATACCACACCTACAATAGTCAATTTAGACCATCCCGTAATA 
TAAAGAGCAACACTTCCACTATGGTGTGGATGTTATCAGTTAAATCTGGTAGGGCATTAT 
461 ISRCEGDTTPTIVNLDHPVI 480 

TCTTGTGCCAAAACGAAACAATTGCGAGTTGTAAATGGGATTCCAACACGAACAAACATA 

1441 + + + + + + 1500 

AGAACACGGTTTTGCTTTGTTAACGCTCAACATTTACCCTAAGGTTGTGCTTGTTTGTAT 

481 SCAKTK.QLRVVNGI PTRTNI 500 

^^^^ GGATGGATGGTTAGTTTGAGATACAGAAATAAACATATCTGCGGAGGATCATTGATAAAG 
CCTACCTACCAATCAAACTCTATGTCTTTATTTGTATAGACGCCTCCTAGTAACTATTTC 
501 GWMVSLRYRNKHICGGSLIK 520 

GAGAGTTGGGTTCTTACTGCACGACAGTGTTTCCCTTCTCGAGACTTGAAAGATTATGAA 

1561 + + + + + + 1620 

CTCTCAACCCAAGAATGACGTGCTGTCACAAAGGGAAGAGCTCTGAACTTTCTAATACTT 

521 ESWVLTARQCFPSRDLKDYE 540 

GCTTGGCTTGGAATTCATGATGTCCACGGAAGAGGAGATGAGAAATGCAAACAGGTTCTC 

1621 + + + + + + 1680 

CGAACCGAACCTTAAGTACTACAGGTGCCTTCTCCTCTACTCTTTACGTTTGTCCAAGAG 

541 AWLGIHDVHGRGDEKCKQVL 560 

AATGTTTCCCAGCTGGTATATGGCCCTGAAGGATCAGATCTGGTTTTAATGAAGCTTGCC 

1681 + + + + + + 1740 

TTACAAAGGGTCGACCATATACCGGGACTTCCTAGTCTAGACCAAAATTACTTCGAACGG 

561 NVSQLVYGPEGSDLVLMKLA 580 

AGGCCTGCTGTCCTGGATGATTTTGTTAGTACGATTGATTTACCTAATTATGGATGCACA 
TCCGGACGACAGGACCTACTAAAACAATCATGCTAACTAAATGGATTAATACCTACGTGT 
581 RPAVLDDFVST I DLPNYGCT 600 
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(continued) 

ATTCCTGAAAAGACCAGTTGCAGTGTTTATGGCTGGGGCTACACTGGATTGATCAACTAT 

■igoi H + H + "* lOOU 

TAAGGACTTTTCTGGTCAACGTCACAAATACCGACCCCGATGTGACCTAACTAGTTGATA 



601 I 



PEKTSCSVYGWGYTGLINY 620 



GATGGCCTATTACGAGTGGCACATCTCTATATAATGGGAAATGAGAAATGCAGCCAGCAT 
1861 ^ + ^ ^^"^^ 



621DGLL R.VAHLYIM GNE KCSQH 640 

CATCGAGGGAAGGTGACTCTGAATGAGTCTGAAATATGTGCTGGGGCTGAAAAG^ ^^^^ 
^ GTAGCTCCCTTCCACTGAGACTTACTCAGACTTTATACACGACC 
641HRGKVTLNESEICAGAEKIG 660 

TCAGGACCATGTGAGGGGGATTATGGTGGCCCACpGpTGTGAGCAACATAAAAT^ ^ 

■"■^^ ^ AGTCCTGGTACACTCCCCCTAATACCACCGGGTGAA^ 
661SGPCEGDYGGPLVCEQHKMR 680 

ATGGTTCTTGGTGTCATTGTTCCTGGTCGTGGATGTGCCATTCCAAATCGTCCTGGTAT^ ^^^^ 
^ ^ ^ ^ ACCAAGAACCACAGTAACAAGGACCAGCACCTACACGGTAAGG 
681MVLGVIVPGRGCAIPNRPGI 700 

TTTGTCCGAGTAGCATATTATGCAAAATGGATACACAAAATTATTTTAACATAT^^ 2160 
^ ° AAACAGGCTCATCGTATAATACGTTTTACCT^^ 



701 F V 



RVAYYAKWIHKIILTYKV 720 



CCACAGTC ATAG 

2161 T= 2172 

GGTGTCAGTATC 

721 P Q S * 723 
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FIG lb 



ATGGGGTGGCTCCCACTCCTGCTGCTTCTGACTCAATGCTTAGGGGTCCCTGGGCAGCGC 
^ TACCCCACCGAGGGTGAGGACGACGAAGACTGAGTTACGAATCCCCAGGGACCCGTCGCG 
IMGWLPLLLLLTQCLGVPGQR 20 

TCGCCATTGAATGACTTCCAAGTGCTCCGGGGCACAGAGCTACAGCACCTGCTACATGCG 

_■ _^ — -1. — — _ — _ ■ ^ — ■ — — — — — ——— — — — — — ——— + X20 

, AGCGGTAACTTACTGAAGGTTCACGAGGCCCCGTGTCTCGATGTCGTGGACGATGTACGC. 

21SPLNDFQVLRGTELQHLLHA 40 

GTGGTGCCCGGGCCTTGGCAGGAGGATGTGGCAGATGCTGAAGAGTGTGCTGGT^ ^ ^ ^ 

^'^'^ CACCACGGGCCCGGAACCGTCCTCCTACACCGTCTACGACTTCTCACACGACCAGCGACA 
41VVPGPWQEDVADAEECAGRC 60 

GGGCCCTTAATGGACTGCCGGGCCTTCCACTACAACGTGAGCAGCCATGGTTGCCAACTG 

igl |. I h — — 1 — +— — — ■ — H 240 

CCCGGGAATTACCTGACGGCCCGGAAGGTGATGTTGCACTCGTCGGTACCAACGGTTGAC 

61GPLMDCRAFHYNVSSHGCQL 80 



CTGCCATGGACTCAACACTCGCCCCACACGAGGCTGCGGCGTTCTGGGCGCTGTGACCTC 
' ^ ^ GACGGTACCTGAGTTGTGAGCGGGGTGTGCTCCGACGCCGCAAGACCCGCGACACTGGAG 
81LPWTQHSPHTRLRRSGRCDL 100 



TTCCAGAAGAAAGACTACGTACGGACCTGCATCATGAACAATGGGGTTGGGTACCGGGGC 

301 + + + + + + 3o0 

AAGGTCTTCTTTCTGATGCATGCCTGGACGTAGTACTTGTTACCCCAACCCATGGCCCCG 

101 FQKKDYVRTCIMNNGVGYRG 120 

ACCATGGCCACGACCGTGGGTGGCCTGCCCTGCCAGGCpGGAGCCACAAGTTCCCGAAT ^ ^ ^ 
■^^■^ TGGTACCGGTGCTGGCACCCACCGGACGGGACGGTCCGAACCTCGGTGTTCAAGGGCTTA 
121TMATTVGGLPCQAWSHKFPN 140 

GATCACAAGTACACGCCCACTCTCCGGAATGGCCTGGAAGAGAACTTCTGCCGTAACCCT 

421 H + + + "** 

CTAGTGTTCATGTGCGGGTGAGAGGCCTTACCGGACCTTCTCTTGAAGACGGCATTGGGA 

141 DHKYTPTLRNGLEENFCRNP 160 

(continued) 
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4 31 GATGGCGACCCCGGAGGTCCTTGGTGCTACACAACAGACCCTGCTGTGCGCTTCCAGAGC 

CTACCGCTGGGGCCTCCAGGAACCACGATGTGTTGTCTGGGACGACACGCGAAGGTCTCG ^ ^ ° 
161 DGDPGGPWCYTTDPAVRFQS 180 

TGCGGCATCAAATCCTGCCGGGAGGCCGCGTGTGTCTGGTGCAATGGCGAGGAATACCGC 



531 

ACGCCGTAGTTTAGGACGGCCCTCCGGCGCACACAGACCACGTTACCGCTCCTTATGGCG 



600 

181 CGIKSCREAACVWCNGEEYR 200 



GGCGCGGTAGACCGCACGGAGTCAGGGCGCGAGTGCCAGCGCTGGGATCTTCAGCACCCG 
CCGCGCCATCTGGCGTGCCTCAGTCCCGGG<iTCACGGTCGCGACCCTAGAAGTCGTGGGC 
201 GAVDRTESGRECQRWDLQHP 220 

^ CACCAGCACCCCTTCGAGCCGGGCAAGTTCCTCGACCAAGGTCTGGACGACAACTATTGC 

GTGGTCGTGGGGAAGCTCGGCCCGTTCAAGGAGCTGGTTCCAGACCTGCTGTTGATAACG ''^^ 
221 HQHPFE PGKFLDQGLDDNYC 240 

CGGAATCCTGACGGCTCCGAGCGGCCATGGTGCTACACTACGGATCCGCAGATCGAGCGA 
GCCTTAGGACTGCCGAGGCTCGCCGGTACCACGATGTGATGCCTAGGCGTCTAGCTCGCT 
241 RNPDGSERPWCYTTDPQIER 260 

_^ ^ ^ GAGTTCTGTGACCTCCCCCGCTGCGGGTCCGAGGCACAGCCCCGCCAAGAGGCCACAACT 
CTCAAGACACTGGAGGGGGCGACGCCCAGGCTCCGTGTCGGGGCGGTTCTCCGGTGTTGA 
261 EFC DLP RCGS EA QPRQEATT 280 

^ ^ ^ GTCAGCTGCTTCCGCGGGAAGGGTGAGGGCTACCGGGGCACAGCCAATACCACCACTGCG 
CAGTCGACGAAGGCGCCCTTCCCACTCCCGATGGCCCCGTGTCGGTTATGGTGGTGACGC 
281 VSCFRGKGEGYRGTANTTTA 300 

GGCGTACCTTGCCAGCGTTGGGACGCGCAAATCCCGCATCAGCACCGATTTACGCCAGAA 

901 + + + + + + 960 

CCGCATGGAACGGTCGCAACCCTGCGCGTTTAGGGCGTAGTCGTGGCTAAATGCGGTCTT 

301 GV P CQRW DAQ I PHQH RFT PE 320 

AAATACGCGTGCAAAGACCTTCGGGAGAACTTCTGCCGGAACCCCGACGGCTCAGAGGCG 

961 + + + + + + 1020 

TTTATGCGCACGTTTCTGGAAGCCCTCTTGAAGACGGCCTTGGGGCTGCCGAGTCTCCGC 

321 KYACKDLRENFCRNPDGSEA 340 

^ CCCTGGTGCTTCACACTGCGGCCCGGCATGCGCGCGGCCTTTTGCTACCAGATCCGGCGT 
GGGACCACGAAGTGTGACGCCGGGCCGTACGCGCGCCGGAAAACGATGGTCTAGGCCGCA 
341 PWCFTLRPGMRAAFCYQIRR 360 

TGTACAGACGACGTGCGGCCCCAGGACTGCTACCACGGCGCAGGGGAGCAGTACCGCGGC 
ACATGTCTGCTGCACGCCGGGGTCCTGACGATGGTGCCGCGTCCCCTCGTCATGGCGCCG 
361 CTDDVRPQDCYHGAGEQYRG 380 
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ACGGTCAGCAAGACCCGCAAGGGTGTCCAGTGCCAGCGCTGGTCCGCTGAGACGCCGCAC 

1141 + + + + + + 1200 

TGCCAGTCGTTCTGGGCGTTCCCACAGGTCACGGTCGCGACCAGGCGACTCTGCGGCGTG 

381 TVSKTRKGVQCQRWSAETPH 400 

AAGCCGCAGTTCACGTTTACCTCCGAACCGCATGCACAACTGGAGGAGAACTTCTGCCGG 
1201 + + + + + + 1260 

ttcggcgtcaagtgcaaatggaggcttggcgtacgtgttgacctcctcttgaagacggcc 
401 kpqftftsephaqleenfcr 420 

aacccagatggggatagccatgggccctggtgctacacgatggacccaaggaccccattc 
ttgggtctacccctatcggtacccgg'gaccacgatgtgctacctgggttcctggggtaag 
421 npdgdshgpwcytmdprtpf 440 

gactactgtgccctgcgacgctgcgctgatgaccagccgccatcaatcctggacccccca 

1321 + + + + + + 1380 

ctgatgacacgggacgctgcgacgcgactactggtcggcggtagttaggacctggggggt 
441 dycal rrcaddqppsi ldpp 460 

gaccaggtgcagtttgagaagtgtggcaagagggtggatcggctggatcagcggcgttcc 
ctggtccacgtcaaactcttcacaccgttctcccacctagccgacctagtcgccgcaagg 
461 dqvqfekcgkrvdrldqrrs 480 

aagctgcgcgtggttgggggccatccgggcaactcaccctggacagtcagcttgcggaat 
ttcgacgcgcaccaacccccggtaggcccgttgagtgggacctgtcagtcgaacgcctta 
481 klrvvgghpgns pwtvslrn 500 

cggcagggccagcatttctgcggggggtctctagtgaaggagcagtggatactgactgcc ^^^^ 
gccgtcccggtcgtaaagacgccccccagagatcacttcctcgtcacctatgactgacgg 
501 rqgqhfcggslvk eqwilta 520 

cggcagtgcttctcctcctgccatatgcctctcacgggctatgaggtatggttgggcacc ^^^^ 
gccgtcacgaagaggaggacggtatacggagagtgcccgatactccataccaacccgtgg 
521 rqcfsschmplt gyevwlgt 540 

ctgttccagaacccacagcatggagagccaagcctacagcgggtcccagtagccaagatg 

1621 + + + + + + 1680 

GACAAGGTCTTGGGTGTCGTACCTCTCGGTTCGGATGTCGCCCAGGGTCATCGGTTCTAC 

541 LFQNPQHGEPSLQRVPVAKM 560 

GTGTGTGGGCCCTCAGGCTCCCAGCTTGTCCTGCTCAAGCTGGAGAGATCTGTGACCCTG ^ q 
CACACACCCGGGAGTCCGAGGGTCGAACAGGACGAGTTCGACCTCTCTAGACACTGGGAC 
561 VCGPSGSQLVLLKLERSVTL 580 

AACCAGCGTGTGGCCCTGATCTGCCTGCCCCCTGAATGGTATGTGGTGCCTCCAGGGACC qq 
TTGGTCGCACACCGGGACTAGACGGACGGGGGACTTACCATACACCACGGAGGTCCCTGG 
581 NQRVALICLPPEWYVVPPGT 600 
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(continued) 



AAGTGTGAGATTGCAGGCTGGGGTGAGACCAAAGGTACGGGTAATGACACAGTCCTAAAT 

1801 + + + + + + 1860 

TTCACACTCTAACGTCCGACCCCACTCTGGTTTCCATGCCCATTACTGTGTCAGGATTTA 

601 KCE lAGWGETKGTGNDTVLN 620 



GTGGCCTTTCTGAATGTTATCTCCAACCAGGAGTGTAACATCAAGCACCGAGGACGTGTG 

1861 + + + + + + 1920 

CACCGGAJy^GACTTACAATAGAGGTTGGTCCICACATTGTAGXTC^ 

621 V A F L N V "I" 'S' "N' Q" ' E 'C N I K H R G R V 640 



CGGGAGAGTGAGATGTGCACTGAGGGACTGTTGGCCCCTGTGGGGGCCTGTGAGGGTGAC ^^qq 
GCCCTCTCACTCTACACGTGACTCCCTGACAACCGGGGACACCCCCGGACACTCCCACTG 
641 RESEMCTEGLLAPVGACEGD 660 



TACGGGGGCCCACTTGCCTGCTTTACCCACAACTGCTGGGTCCTGGAAGGAATTATAATC ^ 
ATGCCCCCGGGTGAACGGACGAAATGGGTGTTGACGACCCAGGACCTTCCTTAATATTAG 
661 YGGPLACFTHNCWVLEGI II 680 



CCCAACCGAGTATGCGCAAGGTCCCGCTGGCCAGCTGTCTTCACGCGTGTCTCTGTGTTT 210O 
GGGTTGGCTCATACGCGTTCCAGGGCGACCGGTCGACAGAAGTGCGCACAGAGACACAAA 
681 PNRVCARS RW PAVFTRVSVF 700 



GTGGACTGGATTCACAAGGTCATGAGACTGGGTTAG 

2101 + + + = 2136 

CACCTGACCTAAGTGTTCCAGTACTCTGACCCAATC 

701 VDWIHKVMRLG* 711 
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Linker 



Tag 



Leader sequence 



Kl K2 



HL 



Kl K2 



MSP 



HGF 
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FIG 2b 

GAATTCCACCATGGGGTGGCTCCCACTCCTGCTGCTTCTGACTCAATGCTTAGGGGTCCC 

^ ^ 4_ ^ + &o 

CTTAAGGTGGTACCCCACCGAGGGTGAGGACGACGAAGACTGAGTTACGAATCCCCAGGG 



1 



MGWLPLLLLLTQCLGVP 17 



TGGGCAGCGCTCGCCATTGAATGACTTCCAAGTGCTCCGGGGCACAGAGCTACAGCACCT 

c I — — — + — — — — — — — — — — — — — — -- — --H — — — + ±.J.\J 

ACCCGTCGCGAGCGGTAACTTACTGAAGGTTCACGAGGCCCCGTGTCTCGATGTCGTGGA 
18 GQRSPLNDFQV L,... R-. G T E L Q K L 37 

GCTACATGCGGTGGTGCCCGGGCCTTGGCAGGAGGATGTGGCAGATGCTGAAGAGTGTGC 
■^^■^ CGATGTACGCCACCACGGGCCCGGAACCGTCCTCCTACACCGTCTACGACTTCTCACACG 
38 LHAVVPGPWQEDVADAEECA 57 

TGGTCGCTGTGGGCCCTTAATGGACTGCCGGGCCTTCCACTACAACGTGAGCAGCCATGG 
ACCAGCGACACCCGGGAATTACCTGACGGCCCGGAAGGTGATGTTGCACTCGTCGGTACC 
58 GRCGPLMDCRAFHYNVSSHG 77 

TTGCCAACTGCTGCCATGGACTCAACACTCGCCCCACACGAGGCTGCGGCGTTCTGGGCG 
AACGGTTGACGACGGTACCTGAGTTGTGAGCGGGGTGTGCTCCGACGCCGCAAGACCCGC 
78 CQLLPWTQH SP HTRLRRSGR 97 

CTGTGACCTCTTCCAGAAGAAAGACTACGTACGGACCTGCATCATGAACAATGGGGTTGG 

301 + + + + + + -5*>'J 

GACACTGGAGAAGGTCTTCTTTCTGATGCATGCCTGGACGTAGTACTTGTTACCCCAACC 

98 CDLFQKKDYVRTCIMNNGVG 117 

GTACCGGGGCACCATGGCCACGACCGTGGGTGGCCTGCCCTGCCAGGCTTGGAGCCACAA 
"^^^ CATGGCCCCGTGGTACCGGTGCTGGCACCCACCGGACGGGACGGTC^ 

118 YRGTMATTVGGLPCQAWSHK 137 

GTTCCCGAATGATCACAAGTACACGCCCACTCTCCGGAATGGCCTGGAAGAGAACTTCTG 

421 + + + + + + 

CAAGGGCTTACTAGTGTTCATGTGCGGGTGAGAGGCCTTACCGGACCTTCTCTTGAAGAC 



138 FPNDHKYTPTLRNGLEENFC 



157 



CCGTAACCCTGATGGCGACCCCGGAGGTCCTTGGTGCTACACAACAGACCCTGCTGTGCG 

481 + + + + + + 

GGCATTGGGACTACCGCTGGGGCCTCCAGGAACCACGATGTGTTGTCTGGGACGACACGC 

158 RNPDGDPG GPWCYTTDPAVR 177 

(continued) 
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CTTCCAGAGCTGCGGCATCAAATCCTGCCGGGAGGCCGCGTGTGTCTGGTGCAATGGCGA 
^ ^ ^ GAAGGTCTCGACGCCGTAGTTTAGGACGGCCCTCCGGCGCACACAGACCACGTTACCGCT 
178 FQSCGIKSCREAACVWCNGE 197 

GGAATACCGCGGCGCGGTAGACCGCACGGAGTCAGGGCGCGAGTGCCAGCGCTGGGATCT 

601 + + + + + + 660 

CCTTATGGCGCCGCGCCATCTGGCGTGCCTCAGTCCCGCGCTCACGGTCGCGACCCTAGA 

198 EYRGAVDRTE SGRECQRWDL 217 
TCAGCACCCGCACCAGCACCCCTTCGAGCCGGGCAAGTTCCTCGACCAAGGTCTGGACGA 

^ v\ ■ ; . :Aif cgtgggcgtggtcgtggggaagctcggcccgttcaAggagctggttccagacctgct -'v^^^ 

218 Q H ^ H.Q H "P F EPG KFLDQGLDD 237 

caactattgccggaatcctgacggctccgagcggccatggtgctacactacggatccgca 
^ gttgataacggccttaggactgccgaggctcgccggtaccacgatgtgatgcctaggcgt 
238 nycrnpdgserpwcyt tdpq 257 

gatcgagcgagagttctgtgacctcccccgctgcgggtccgaggcacagccccgcctcga 
^ ctagctcgctctcaagacactggagggggcgacgcccaggctccgtgtcggggcggagct 
258 lerefcdlprcgseaqprle 277 

gggcggtggcggttctggtggcggtggctccggcggtggcggttctctagagggacaaag 
^ ^ ^ cccgccaccgccaagaccaccgccaccgaggccgccaccgccaagagatctccctgtttc 

278 GGGGSGGGGSGGGGSLEGQR 297 

gaaaagaagaaatacaattcat GAATTCAAAAAATCAGCAAAGACTACCCTAAT caaaat 

QOl ■ + + + 960 

CTTTTCTTCTTTATGTTAAGTACTTAAGTTTTTTAGTCGTTTCTGATGGGATTAGTTTTA 

298 KRRNTIHEFKKSAKTTLIKI 317 

AGATCCAGCACTGAAGATAAAAACCAAAAAAGTGAATACTGCAGACCAATGTGCTAATAG 

961 + 1O20 

TCTAGGTCGTGACTTCTATTTTTGGTTTTTTCACTTATGACGTCTGGTTACACGATTATC 

318 DPALKIK TKKVNTADQCANR 337 

ATGTACTAGGAATAAAGGACTTCCATTCACTTGCAAGGCTTTTGTTTTTGATAAAGCAAG 
^^^^ TACATGATCCTTATTTCCTGAAGGTAAGTGAACGTTCCGAAAACAAAAACTATTTCGTTC 

338 ctrnkglpftckafvfdkar 357 

aaaacaatgcctctggttccccttcaatagcatgtcaagtggagtgaaaaaagaatttgg 

inei ■'" + + 1140 

ttttgttacggagaccaaggggaagttatcgtacagttcacctcacttttttcttaaacc 

358 kQCLWFPFNSMSSGVKKEFG 377 

CCATGAATTTGACCTCTATGAAAACAAAGACTACATTAGAAACTGCATCATTGGTAAAGG 
,,4-, + + + + + + 120C 

* ggtacttaaactggagatacttttgtttctgatgtaatctttgacgtagtaaccatttcc 
378 kefdlyenkdyi rnciigkg 397 

(continued) 
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(continued) 

ACGCAGCTACAAGGGAACAGTATCTATCACTAAGAGTGGCATCAAATGTCAGCCCTGGAG 

1201 + + ' 1250 

TGCGTCGATGTTCCCTTGTCATAGATAGTGATTCTCACCGTAGTTTACAGTCGGGACCTC 

398 RSYKGTVSITKSGIK CQPWS 417 

TTCCATGATACCACACGAACACAGCTATCGGGGTAAAGACCTACAGGAAAACTACTGTCG 

T 261 — H ! 1 ' _____+_____ + 1320 

AAGGTACTATGGTGTGCTTGTGTCGATAGCCCCATTTCTGGATGTCCTTTTGATGACAGC 

418 S M ' l' "'P" ■ H ' E H 's Y R G K " D L Q E N Y Ci R 437 



AAATCCTCGAGGGGAAGAAGGGGGACCCTGGTGTTTCACAAGCAATCCAGAGGTACGCTA 

221 — _ 1 1 — (-— h 138' 

TTTAGGAGCTCCCCTTCTTCCCCCTGGGACCACAAAGTGTTCGTTAGGTCTCCATGCGAT 

438 NPRGEEGGP. WCFTSNPEVRY 457 



CGAAGTCTGTGACATTCCTCAGTGTTCAGAAGTTGAATGCATGACCTGCAATGGGGAGAG 

12Q1 + ■>■ + + + 1440 

GCTTCAGACACTGTAAGGAGTCACAAGTCTTCAACTTACGTACTGGACGTTACCCCTCTC 

458 EVCDI PQCSEVECMTCNGES 477 

TTATCGAGGTCTCATGGATCATACAGAATCAGGCAAGATTTGTCAGCGCTGGGATCATCA ^^^^ 
^^^'^ AATAGCTCCAGAGTACCTAGTATGTCTTAGTCCGTTCTAAACAGTCGCGACCCTAGTAGT 
478 YRGLMDHTESGKICQRWDHQ 497 

GACACCACACCGGCACAAATTCTTGCCTGAAAGATATCCCGACAAGGGCTTTGATGATAA 

1=01 I + 1- + 15oO 

CTGTGGTGTGGCCGTGTTTAAGAACGGACTTTCTATAGGGCTGTTCCCGAAACTACTATT 

498 TPHRHKFLPERYPDKGFDDN 517 

TTATTGCCGCAATCCCGATGGCCAGCCGAGGCCATGGTGCTATACTCTTGACCCTCACAC ^^^^ 
^^^^ AATAACGGCGTTAGGGCTACCGGTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTG 
518 YCRNPDGQPRPWCYTLDPHT 537 

CCGCTGGGAGTACTGTGCAATTAAAACATGCGCTGACAAAGCTGACGACGACGACAAACA ^ ^ 
^^^^ GGCGACCCTCATGACACGTTAATTTTGTACGCGACTGTTTCGACTGCTGCTGCT 
538 RWEYCAIKTCADKADDDDKH 557 



CCACCACCACCACCACCACTAGGGTCGAC 

1681 + ^ 1709 

GGTGGT GGT GGTGGTGGTGATCCCAGCTG 

558 H H H H H H ♦ 563 
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FIG 3a 



■Linker"' '^''e^sTv 



Leader sequence 



HL 



Kl K2 

an 

HGF 



HL 



Kl K2 

HGF 
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FIG 3b 

GGATCCGCCAGCCCGTCCAGCAGCACCATGTGGGTGACCAAACTCCTGCGAGCCCTGCTG 

I + + + + + 60 

CCTAGGCGGTCGGGCAGGTCGTCGTGGTACACCCACTGGTTTGAGGACGGTCGGGACGAC 

1 MWVTKLLPALLll 



CTGCAGCATGTCCTCCTGCATCTCCTCCTGCTCCCCATCGCCATCCCCTATGCAGAGGGA 

€1 + + + + + + 120 

GACGTCGTACAGGAGGACGTAGAGGAGGACGAGGGGTAGCGGTAGGGGATACGTCTCCCT 

12L QHVLLHLLLLPIAIPYAEG 31 

CAAAGGAAAAGAAGAAATACAATTCATGAATTCAAAAAATCAGCAAAGACTACCCTAATC 

121 + + + + + ^ 180 

GTTTCCTTTTCTTCTTTATGTTAAGTACTTAAGTTTTTTAGTCGTTTCTGATGGGATTAG 

32QRKRRNTIHEFKKSAKTTLI 51 

AAAATAGATCCAGCACTGAAGATAAAAACCAAAAAAGTGAATACTGCAGACCAATGTGCT 

181 + + + +— + + 240 

TTTTATCTAGGTCGTGACTTCTATTTTTGGTTTTTTCACTTATGACGTCTGGTTACACGA 

52KIDPALKIKTKKVNTADQCA 71 

AATAGATGTACTAGGAATAAAGGACTTCCATTCACTTGCAAGGCTTTTGTTTTTGATAAA 

241 -i t---- 1 — — 1- (• 300 

TTATCTACATGATCCTTATTTCCTGAAGGTAAGTGAACGTTCCGAAAACAAAAACTAT.TT. 

72 NRCTRNKGLPFTCKAFVFDK 91 

GCAAGAAAACAATGCCTCTGGTTCCCCTTCAATAGCATGTCAAGTGGAGTGAAAAAAGAA 

301 + + + + + + 360 

CGTTCTTTTGTTACGGAGACCAAGGGGAAGTTATCGTACAGTTCACCTCACTTTTTTCTT 

92ARKQCLWFPFNSMSSGVKKE 111 

TTTGGCCATGAATTTGACCTCTATGAAAACAAAGACTACATTAGAAACTGCATCATTGGT ^ 
^ AAACCGGTACTTAAACTGGAGATACTTTTGTTTCTGATGTAATCTTTGACGTAGTAACCA 
112 FGHEFDLYEN-KDYIRNCI IG 131 

AAAGGACGCAGCTACAAGGGAACAGTATCTATCACTAAGAGTGGCATCAAATGTCAGCCC 
421 H + H H -I 

TTTCCTGCGTCGATGTTCCCTTGTCATAGATAGTGATTCTCACCGTAGTTTACAGTCGGG 
132KGRSYKGTVSITKSGIKCQP 151 



TGGAGTTCCATGATACCACACGAACACAGCTATCGGGGTAAAGACCTACAGGAAAACTAC 

4gi ^ I + H -i + 540 

ACCTCAAGGTACTATGGTGTGCTTGTGTCGATAGCCCCATTTCTGGATGTCCTTTTGATG 
152WSSMIPHEHSYRGKDLQENY 171 



(continued) 
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600 

L-AGGTCTCCAT 

GEEGGPWCFTSNPEV 191 




GLMDHTESGKICQ 




HRHKFLPERYPD KGF 

781 ^^E^E^!JEEEEE^EESEE^!?fEE^5^C'^AGGCCATGGTGCTATACTCTTGACCCT 

CTATTAATAACGGCGTTAGGGCTACCGGTCGGCTCCGGTACCACGATAT^ 
252 DNYCRNPDGQPRPWCYTLD 



R W D 231 

180 
251 

840 



P 271 



841 E^E^EEEEE+EEE^EE^EI+HEE^IE^^E^'^^'^'^^^^Ac^^GCTTCGGGCGGTGGc 

GTGTGGGCGACCCTCATGACACGTTAATTTTGTACGCGACTGTTTCGAAG^ 



900 




960 
311 



1020 
331 



GSGGGGSLEGQRKRR 

961 ^Z^S^ll^l^^iis.^:^^^:^lS^'z 

TTATGTTAAGTACTTAAGTTTTTTAGTCGTTTCTGATGGGATTA^ 
312NTIHEFK KSAKTTLIKIDPA 

1021 EZE^E^E^J^EE^^^^EIE^P^'^^^^'^^^^^TGTGCTAATAGATGTACTAGG 

GACTTCTATTTTTGGTTTTTTCACTTATGACGTCTGGTTACACGAtS^ 1080 
332 LKIKTKKVNTADQCANR CTR 351 

1081 ^^^EE^JIZEE^IZE^JIIEE^E^^'^'^'^E^Z'^I'^'^^^T^G'^^G^^AAACAATGc 

TTATTTCCTGAAGGTAAGTGAACGTTCCGAAiiCAAAAACTATTTCG^ 114 0 

352 NKGLPFTCKAFVFDKARKQC 371 



1141 



CTCTGGTTCCCCTTCAATAGCATGTCAAGTGGAGTGAAAAAAGAAT 

GAGACCAAGGGGAAGTTATCGTACAGTTCACCTCACTTTTTTCTTAAACCGGTACTTJ^ ^ ° ° 
372 LWFPFNSMSSGVKKEFGHEF 



391 



(continued) 
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1201 E^ESISI^IJ^^S^E J^^^^^TTAGAAACTGCATCATTGGTAAAGGACGCAGCTAC 
CTGGAGATACTTTTGTTTCTGATGTAATCTTTGACGTAGTAACCATTTCCTGC 
392 DLyENKDYIRNCIIGKGRSY 411 



1261 ^EEE^E^SI^IEI^EE^^^^'^^^'^^^^^^^^^T^TCA'^CCCTGGAGTTCCATGATA 
TTCCCTTGTCATAGATAGTGATTCTCACCGTAGTTTACAGTCGGGAC 

412KGTVSITKSGIKCQPWSSMI 431 

1321 SS^EiS^^'^S^'^E^^^ 

GGTGTGCTTGTGTCGATAGCCCCATTTCTGGATGTCCTTTTGATGACAG^^ 
432 PHEHSYRGKDLQENYCRNPR 451 



1381 EEEE^E^JEEEE^EEEZSEIEIZEE^^^^^^^'^^^^EE^^^'^^"^^^^^^^^^'^ 

ccccttcttccccctgggaccacaw^gtStcgttaggtctccatgcga^^ ^ ^ ^ ° 

452 GEEGGPWCFTSNPE VRYEVC 471 

1441 E^E^EEEZS^EHEEIE^E^^EZIE^^^^I^^^'^ZE^^^EE^^^^^^'^^^^^^^^^ 

CTGTAAGGAGTCACAAGTCTTCAACTTACGTACTGGACGTTACCCCTCTCAA ^^^^ 
472 DIPQCSEVECMTCNGESYRG- 491 

1501 EIE^EEE^HS^H^E^E^IJ^EEE^E^^'^^^^^^^^^'^^^^^'^^'^^^^ACAccACAc 

gagtacctagtatgtcttagtccgttctL^cagtcgcgaccctagtagtctgt^ """^ ^° 

492 LMDHTESGKICQRWDHQTPH 511 

1561 EEEE^E^J IEEIEEEZE^^^^I^Z^^^^^^^E^^'^'^''^^^^^^'^^'^'rATTGccGc 

GCCGTGTTTAAGAACGGACTTTCTATAGGGCTGTTCCCGAAACTACTATTAATAACG^ ^ ° 



512RHKFLPERYPDKGFD DNYCR 



531 



1621 ^EEEEE-E5EEE^^^^^^^°^^^^^Z^^^^'^-'^^''^^'^'^^^^^^'^^^CCC'^CTGGGAG 

TTAGGGCTACCGGTCGGCTCCGGTACCACGATATGAGAACTGGGAGTGTGGGCGAC ^ ^ ^ ° 

532 NPDGQPRPWCYTLDPHTRWE 551 

1681 I^EZEIEE^II^^E^I+^^^'^^^^^^^^EI^^^^^^'^^^^^^^^^^^^^^'^^^^ 

ATGACACGTTAATTTTGTACGCGACTGTTTCGACTGCTGCTGCTGTTTGTGGTGGTG^^ "^"^ ^ ° 
552 YCAIKTCADKADDD 

CACCACCACTAGGGTCGAC 

1741- T= 1759 

GTGGTGGTGATCCCAGCTG 

572 H H H ♦ 574 



D K H H H H 571 
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Fig 6 
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1:10 1:2.5 



METRON 
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Fig 7 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: DOMPE' S.p.A. 

(B) STREET: Via Canpo di Pile 

(C) CITY: L'AQUILA 

(E) COUNTRY: ITALY 

(F) POSTAL CODE (ZIP): 67100 

(ii) TITLE 'OF INVENTION: RECOMBINANT PROTEINS DERIVED FROM HGF AND MSP 
(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: im PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
ATGTGGGTGA CCAAACTCCT GCCAGCCCTG CTGCTQCAGC ATGTCCTCCT GCATCTCCTC 
CTGCTCCCCA TCGCCATCCC CTATGCAGAG GQACAAAGGA AAAGAAGAAA TACAATTCAT 
GAATTCAAAA AATCAGCAAA GACTACCCTA ATCAAAATAG ATCCAGCACT GAAGATAAAA 
ACCAAAAAAG TGAATACTGC AGACCAATGT GCTAATAGAT GTACTAGGAA TAAAGGACTT 
CCATTCACTT GCAAGGCTTT TGTTTTTGAT AAAGCAAGAA AACAATGCCT CTGGTTCCCC 
TTCAATAGCA TGTCAAGTG6 A6TGAAAAAA GAATTTGGCC ATGAATTTGA CCTCTATGAA 
AACAAAGACT ACATTAGAAA CTGCATCATT GGTAAAGGAC GCAGCTACAA GGGAACAGTA 
TCTATCACTA AGAGTGGCAT CAAATGTCAG CCCTGGAGTT CCATGATACC ACACGAACAC 
AGCTATCGGG GTAAAGACCT ACAGGAAAAC TACTGTCGAA ATCCTCGAG6 GGAAGAAGG6 
GGACCCTGGT GTTTCACAAG CAATCCAGAG GTACGCTACG AAGTCTGTGA CATTCCTCA6 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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TGTTCAGAAG TTGAATGCAT GACCTGCAAT GGGGAGAGTT ATCX3AGGTCT CATGGATCAT 660 
ACAGAATCAG GCAAGATTTG TCAGCGCTGG GATCATCAGA CACCACACCG GCACAAATTC 720 
TTGCCTGAAA GATATCCCGA CAAGGGCTTT GATGATAATT ATTGCCGCAA TCCCGATGGC 780 
CAGCCGAGGC CATGGTGCTA TACTCTTGAC CCTCACACCC GCTGGGAGTA CTGTGCAATT 840 
AAAACATGCG CTGACAAAGC TTCGGGCGGT GGCGGTTCTG GTGGCGGTGG CTCCGGCGGT 900 
GGCGGTTCTC TAGAGGGACA AAGGAAAAGA AGAAATACAA TTCATGAATT CAAAAAATCA 960 
GCAAAGACTA CCCTAATCAA AATAGATCCA GCACTGAAGA TAAAAACCAA AAAAGTGAAT 1020 
ACTGCAGACC AATGTGCTAA TAGATGTACT AGGAATAAAG GACTTCCATT CACTTGCAAG 1080 
GCTTTTGTTT TTGATAAAGC AAGAAAACAA TGCCTCTGGT TCCCCTTCa^ TAGCATGTCA 1140 
AGTGGAGTGA AAAAAGAATT TGGCCATGAA TTTGACCTCT ATGAAAACAA AGACTACATT 1200 
AGAAACTGCA TCATTGGTAA AGGACGCAGC TACAAGGGAA CAGTATCTAT CACTAAGACT 1260 
GGCATCAAAT GTCAGCCCTG GAGTTCCATG ATACCACACG AACACAGCTA TCGGGGTAAA 1320 
GACCTACAGG AAAACTACTG TCGAAATCCT CGAGGGGAAG AAGGGGQACC CTGGTGTTTC 1380 
ACAAGCAATC CAGAGGTACG CTACGAAGTC TGTGACATTC CTCAGTGTTC AGAAGTTGAA 1440 
TGCATGACCT GCAATGGGGA 6AGTTATCGA GGTCTCATGG ATCATACAGA ATCAGGCAAG 1500 
ATTTGTCAGC GCTGGGATCA TCAGACACCA CACCGGCACA AATTCTTGCC TGAAAGATAT 1560 
CCCGACAAGG GCTTTGATGA TAATTATTGC CGCAATCCCG ATGGCCAGCC GAGGCCATGG 1620 
TGCTATACTC TTGACCCTCA CACCCGCTGG GAGTACTGTG CAATTAAAAC ATGCGCT6AC 1680 
AAAGCT6ACG ACGACGACAA ACACCACCAC CACCACCACC ACTAG 1725 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Trp Val Thr Lys Leu Leu Pro Ala Leu Leu Leu Gin His Val Leu 
15 10 15 

Leu His Leu Leu Leu Leu Pro He Ala He Pro Tyr Ala Glu Gly Gin 
20 25 30 

Arg Lys Arg Arg Asn Thr He His Glu Phe Lys Lys Ser Ala Lys Thr 
35 40 45 

Thr Leu He Lys He Asp Pro Ala Leu Lys He Lys Thr Lys Lys Val 
50 55 60 

Asn Thr Ala Asp Gin Cys Ala Asn Arg Cys Thr Arg Asn Lys Gly Leu 
65 70 75 80 

Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg Lys Gin Cys 
85 90 95 

Leu Trp Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys Lys Glu Phe 
100 105 110 

Gly His Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He Arg Asn Cys 
115 120 125 

He He Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser He Thr Lys 
130 135 140 

Ser Gly He Lys Cys Gin Pro Trp Ser Ser Met He Pro His Glu His 
145 150 155 160 

Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg Asn Pro Arg 
165 170 175 

Gly Glu Glu Gly Gly Pro Trp Cys Phe Thr Ser Asn Pro Glu Val Arg 
180 185 190 

Tyr Glu Val Cys Asp He Pro Gin Cys Ser Glu Val Glu Cys Met Thr 
195 200 205 

Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr Glu Ser Gly 
210 215 220 
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Lys? He Cys Gin Arg Trp Asp His Gin Thr Pro His Arg His Lys Phe 
225 230 235 240 

Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn Tyr Cys Arg 
245 250 255 

Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu Asp Pro His 
260 265 270 

Tlir Arg Trp Glu Tyr Cys Ala He Lys Thr Cys Ala Asp Lys Ala Ser 
275 280 285 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Leu 
290 295 300 

Glu Gly Gin Arg Lys Arg Arg Asn Thr He His Glu Phe Lys Lys Ser 
305 310 315 320 

Ala Lys Thr Thr Leu He Lys He Asp Pro Ala Leu Lys He Lys Thr 
325 330 335 

Lys Lys Val Asn Thr Ala Asp Gin Cys Ala Asn Arg Cys Thr Arg Asn 
340 345 350 

Lys Gly Leu Pro Phe Thr Cys Lys Ala Phe Val Phe Asp Lys Ala Arg 
355 360 365 

Lys Gin Cys Leu Trp Phe Pro Phe Asn Ser Met Ser Ser Gly Val Lys 
370 375 380 

Lys Glu Phe Gly His Glu Phe Asp Leu Tyr Glu Asn Lys Asp Tyr He 
385 390 395 400 

Arg Asn Cys He He Gly Lys Gly Arg Ser Tyr Lys Gly Thr Val Ser 
405 410 415 

He Thr Lys Ser Gly He Lys Cys Gin Pro Trp Ser Ser Met He Pro 
420 425 430 

His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin Glu Asn Tyr Cys Arg 
435 440 445 

Asn Pro Arg Gly Glu Glu Gly Gly Pro Trp Cys Phe Thr Ser Asn Pro 
450 455 460 

Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin Cys Ser Glu Val Glu 
465 470 475 480 

Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly Leu Met Asp His Thr 
485 490 495 

Glu Ser Gly Lys He Cys Gin Arg Trp Asp His Gin Thr Pro His Arg 
500 505 510 

His Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys Gly Phe Asp Asp Asn 
515 520 525 
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Tyr Cys Arg Asn Pro Asp Gly Gin Pro Arg Pro Trp Cys Tyr Thr Leu 
530 535 540 

Asp Pro His Thr Arg Trp Glu Tyr Cys Ala lie Lys Thr Cys Ala Asp 
545 550 555 560 

Lys Ala Asp Asp Asp Asp Lys His His His His His His His 
565 570 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



ATGGGGTGGC 


TCCCACTCCT 


GCTGCTTCTG 


ACTCAATGCT 


TAGGGGTCCC 


TGGGCAGCGC 


60 


TCGCCATTGA 


ATGACTTCCA 


AGTGCTCCGG 


GGCACAGAGC 


TACAGCACCT 


GCTACATGCG 


120 


GTGGTGCCCG 


GGCCTTGGCA 


GGAGGATGT6 


GCAGATGCT6 


AA6AGTGTGC 


TG6TCGCTCT 


180 


GGGCCCTTAA 


TGGACTGCCG 


GGCCTTCCAC 


TACAACGTGA 


GCAGCCATGG 


TTGCCAACTG 


240 


CTGCCATGGA 


CTCAACACTC 


GCCCCACACG 


AGGCTGCGGC 


GTTCTGGGCG 


CTGTGACCTC 


300 


TTCCAGAAGA 


AAGACTACGT 


ACGGACCTGC 


ATCATGAACA 


ATGGGGTTGG 


GTACCGGGGC 


360 


ACCATGGCCA 


CGACCGTGGG 


TGGCCTGCCC 


TGCCAGGCTT 


GGA6CCACAA 


6TTCCCGAAT 


420 


GATCACAA6T 


ACACGCCCAC 


TCTCCGGAAT 


QGCCTGGAAG 


AGAACTTCTG 


CCGTAACCCT 


480 


GATGGCGACC 


CCG6AGGTCC 


TTGGTGCTAC 


ACAACAGACC 


CTGCTGTGCG 


CTTCCAGAGC 


540 


TGCGGCATCA 


AATCCTGCCG 


G6AGGCCGCG 


TGTGTCTGGT 


GCAATGGC6A 


GGAATACC6C 


600 


GGCGCGGTAG 


ACCGCACGGA 


GTCAGGGCGC 


GA6TGCCAGC 


GCTGGGATCT 


TCAGCACCCG 


660 


CACCAGCACC 


CCTTCGAGCC 


GGGCAAGTTC 


CTCGACCAAG 


GTCTGGACGA 


CAACTATTGC 


720 


CGGAATCCTG 


ACGGCTCCGA 


GCGGCCATG6 


TGCTACACTA 


CGGATCOGCA 


GATCGAG06A 


780 


GAGTTCTGTG 


ACCTCCCCCG 


CTGCGGGTCC 


GAGGCACAGC 


CCCGCCTCGA 


GG6CGGTGGC 


840 


GGTTCTGGTG 


GCGGTGGCTC 


CGGCGGTGGC 


GGTTCTCTAG 


AGGGACAAAG 


GAAAAGAAGA 


900 


AATACAATTC 


ATGAATTCAA 


AAAATCAGCA 


AAGACTACCC 


TAATCAAAAT 


AGATCCAGCA 


960 
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CTGAAGATAA 


AAACCAAAAA 


AGTGAATACT 


GCAGACCAAT GTGCTAATAG ATGTACTAGG 


1020 


AATAAAGGAC 


TTCCATTCAC 


TTGCAAGGCT 


TTTGTTTTTG ATAAAGCAAG AAAACAATGC 


1080 


CTCTGGTTCC 


CCTTCAATAG 


CATGTCAAGT 


GGAGTGAAAA AAGAATTTGG CCATGAATTT 


1140 


GACCTCTATG 


AAAACAAA6A 


CTACATTAGA 


AACTGCATCA TTGGTAAAGG ACGCAGCTAC 


1200 


AAGGGAACAG 


TATCTATCAC 


TAAGAGTGGC 


ATCAAATCTC AGCCCTGGAG TTCCATGATA 


1260 


CCACACGAAC 


ACAGCTATCG 


GGGTAAAGAC 


CTACAGGAAA ACTACTGTCG AAATCCTCGA 


1320 


GGGGAAGAAG 


GGGGACCCT6 


6TGTTTCACA 


AGCAATCCAG AG6TACGCTA CGAAGTCTGT 


1380 


GACATTCCTC 


AGTGTTCAGA 


AGTTGAATGC 


ATGACCTGCA ATGGGGAGAG TTATCGAGCT 


1440 


CTCATGGATC 


ATACAGAATC 


AGGCAAGATT 


TGTCAGCGCT GGGATCATCA GACACCACAC 


1500 




ILIIGCCIGA 


AAGATATCCC 


GACAAGGGCT TTGATGATAA TTATTGCCGC 


1560 


AATCCCGATG 


GCCAGCCGAG 


GCCATGGTGC 


TATACTCTTG ACCCTCACAC CCGCTGGGAG 


1620 


TACTGTGCAA 


TTAAAACATG 


CGCTGACAAA 


GCTGACGACG ACGACAAACA CCACCACCAC 1680 


CACCACCACT 


AG 






1692 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Trp Leu Pro Leu Leu Leu Leu Leu Thr Gin Cys Leu Gly Val 
1 5 10 15 

Pro Gly Gin Arg Ser Pro Leu Asn Asp Phe Gin Val Leu Arg Gly Thr 
20 25 30 

Glu Leu Gin His Leu Leu His Ala Val Val Pro Gly Pro Trp Gin Glu 
35 40 45 

Asp Val Ala Asp Ala Glu Glu Cys Ala Gly Arg Cys Gly Pro Leu Met 
50 55 60 

Asp Cys Arg Ala Phe His Tyr Asn Val Ser Ser His Gly Cys Gin Leu 
65 70 75 80 
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Leu Pro Trp Thr Gin His Ser Pro His Thr Arg Leu Arg Arg Ser Gly 
85 90 95 

Arg Cys Asp Leu Phe Gin Lys Lys Asp Tyr Val Arg Thr Cys He Met 
100 105 110 

Asn Asn Gly Val Gly Tyr Arg Gly Thr Met Ala Thr Thr Val Gly Gly 
115 120 125 

Leu Pro Cys Gin Ala Trp Ser His Lys Phe Pro Asn Asp His Lys Tyr 
130 135 140 

Thr Pro Thr Leu Arg Asn Gly Leu Glu Glu Asn Phe Cys Arg Asn Pro 
145 150 155 160 

Asp Gly Asp Pro Gly Gly Pro Trp Cys Tyr Thr Thr Asp Pro Ala Val 
165 170 175 

Arg Phe Gin Ser Cys Gly He Lys Ser Cys Arg Glu Ala Ala Cys Val 
180 185 190 

Trp Cys Asn Gly Glu Glu Tyr Arg Gly Ala Val Asp Arg Thr Glu Ser 
195 200 205 

Gly Arg Glu Cys Gin Arg Trp Asp Leu Gin His Pro His Gin His Pro 
210 215 220 

Phe Glu Pro Gly Lys Phe Leu Asp Gin Gly Leu Asp Asp Asn Tyr Cys 
225 230 235 240 

Arg Asn Pro Asp Gly Ser Glu Arg Pro Trp Cys Tyr Thr Thr Asp Pro 
245 250 255 

Gin He Glu Arg Glu Phe Cys Asp Leu Pro Arg Cys Gly Ser Glu Ala 
260 265 270 

Gin Pro Arg Leu Glu Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
275 280 285 

Gly Gly Gly Ser Leu Glu Gly Gin Arg Lys Arg Arg Asn Thr He His 
290 295 300 

Glu Phe Lys Lys Ser Ala Lys Thr Thr Leu He Lys He Asp Pro Ala 
305 310 315 320 

Leu Lys He Lys Thr Lys Lys Val Asn Thr Ala Asp Gin Cys Ala Asn 
325 330 335 

Arg Cys Thr Arg Asn Lys Gly Leu Pro Phe Thr Cys Lys Ala Phe Val 
340 345 350 

Phe Asp Lys Ala Arg Lys Gin Cys Leu Trp Phe Pro Phe Asn Ser Met 
355 360 365 



Ser Ser Gly Val Lys Lys Glu Phe Gly His Glu Phe Asp Leu Tyr Glu 
370 375 380 
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Asn Lys Asp Tyr lie Arg Asn Cys lie lie Gly Lys Gly Arg Ser Tyr 
385 390 395 400 

Lys Gly Thr Val Ser He Thr Lys Ser Gly He Lys Cys Gin Pro Trp 
405 410 415 

Ser Ser Met He Pro His Glu His Ser Tyr Arg Gly Lys Asp Leu Gin 
420 425 430 

Glu Asn Tyr Cys Arg Asn Pro Arg Gly Glu Glu Gly Gly Pro Trp Cys 
435 440 445 

Phe Thr Ser Asn Pro Glu Val Arg Tyr Glu Val Cys Asp He Pro Gin 
450 455 460 

Cys Ser Glu Val Glu Cys Met Thr Cys Asn Gly Glu Ser Tyr Arg Gly 
465 470 475 480 

Leu Met Asp His Thr Glu Ser Gly Lys He Cys Gin Arg Trp Asp His 
485 490 495 

Gin Thr Pro His Arg His Lys Phe Leu Pro Glu Arg Tyr Pro Asp Lys 
500 505 510 

Gly Phe Asp Asp Asn Tyr Cys Arg Asn Pro Asp Gly Gin Pro Arg Pro 
515 520 525 

Trp Cys Tyr Thr Leu Asp Pro His Thr Arg Trp Glu Tyr Cys Ala He 
530 535 540 

Lys Thr Cys Ma Asp Lys Ala Asp Asp Asp Asp Lys His His His His 
545 550 555 560 



His His His 



